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REMARKS 

Claims 1-3 and 5-9 are currently pending and under examination in the application. 
Claims 1 1 and 12 were previously withdrawn as being directed to non-elected subject matter. 

Claims 1 and 5 have been amended solely to expedite patent prosecution in accordance 
with the U.S. Patent Office Business Goals (65 Fed. Reg. 54604 (September 8, 2000)). 
Applicant reserves the right to present any cancelled subject matter in a co-pending application. 

Amended claims 1 and 5 recite "protein-protein interaction" and "where the plurality of 
proteins are screened concurrently" (see, inter alia, page 1, lines 12-15; page 4, lines 10-12; page 
7, lines 9-12; page 9, lines 9-22 to page 10, lines 6-22; page 16, lines 3-10; page 18, lines 10-16; 
page 19, lines 1-20 to page 20, lines 1-8; and Example I, page 22, lines 1-19; and Examples II- 
IV, page 23, lines 3-22 of the application as originally filed). 

Newly added claim 13 recites "[a] method of correlating proteomic interaction(s) with 
oxygen tension comprising (a) screening for protein levels of a plurality of proteins, where the 
screening is performed in room air, and where the plurality of proteins are screened concurrently; 
(b) screening for protein levels of a plurality of proteins, where the screening is performed in the 
presence of decreased oxygen tension, and where the plurality of proteins are screened 
concurrently; and (c) correlating the protein level(s) with oxygen tension by identifying at least 
one different protein level between (a) and (b)" (see, inter alia, page 7, lines 6-12; and Example 
IV, page 23, lines 18-22 of the originally filed application).. 

Newly added claim 14 recites "[t]he method of Claim 13 where at least one protein 
employed in the determination is associated with a physiological process or a pathophysiological 
process" (see, inter alia, page 7, lines 6-12 of the application as originally filed). 

Newly added claim 15 recites "[t]he method of Claim 13 where a plurality of 
determinations are made in step (b) with different oxygen tensions being employed in each 
determination" (see, inter alia, page 20, lines 1-3 of the application as originally filed). 

Newly added claim 16 recites "[t]he method of Claim 15 where the oxygen tensions 
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employed are in step (b) range from 0.1 mm Hg to 145 mm Hg" (see, inter alia, page 19, lines 
19-20 of the application as originally filed). 

Newly added claim 17 recites "[t]he method of Claim 13 where the different protein 
levels in step (c) are used to identify protein functions associated with a pathophysiological 
process" (see, inter alia, page 20, lines 6-8 of the application as originally filed). 

These amendments are supported by the application as originally filed, and do not 
constitute new matter. Specific support for the amendments is shown in parentheses, above. 
Entry of these amendments in the application is respectfully requested. 

Previous Rejections 

The Examiner has withdrawn all previous rejections from the Office Action mailed July 
13, 2004 (Office Action, page 2). 

35 U.S.C. §112, First Paragraph 

Claims 1-3 and 5-9 have been rejected under 35 U.S.C. §1 12, first paragraph, as allegedly 
failing to comply with the written description requirement (Office Action, page 3). The 
Examiner states that the claims contain subject matter which was not described in the 
specification in such a way as to reasonably convey to one skilled in the art that the inventor has 
possession of the claimed invention at the time the application was filed. Id. The Examiner 
states that the phrase "a proteomic interaction between at least one protein and a plurality of 
proteins" includes new matter. Id. 

Applicant respectfully disagrees. 

The application specifically teaches that proteomic interactions (e.g., protein-protein 
interactions, protein-DNA interactions, protein activity, and protein expression levels) can be 
screened by several well-known means (see, inter alia, page 9, lines 3-22 to page 11, lines 1-8 of 
the originally filed application). However, solely for the purpose of expediting patent 
prosecution, claims 1 and 5 have been amended and claim 13 has been added as follows. 
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1. A method of establishing a protein-protein interaction map comprising 

(a) screening for a protein-protein interaction between at least one protein and a 
plurality of proteins, where the screening is performed in the absence of a 
simulated redox state perturbation, and where the plurality of proteins are 
screened concurrently; 

(b) screening for a protein-protein interaction between the at least one protein 
and a plurality of proteins, where the screening is performed in the presence of a 
simulated redox perturbation, and where the plurality of proteins are screened 
concurrently; and 

(c) generating the protein-protein interaction map by identifying at least one 
different protein-protein interaction between (a) and (b). 



5. A method of correlating protein-protein interaction(s) with oxygen tension 
comprising 

(a) screening for a protein-protein interaction between at least one protein and a 
plurality of proteins, where the screening is performed in room air, and where the 
plurality of proteins are screened concurrently; 

(b) screening for a protein-protein interaction between the at least one protein 
and a plurality of proteins, where the screening is performed in the presence of 
decreased oxygen tension, and where the plurality of proteins are screened 
concurrently; and 

(c) correlating the protein-protein interaction(s) with oxygen tension by 
identifying at least one different protein-protein interaction between (a) and (b). 



13. A method of correlating protein levels(s) with oxygen tension comprising 

(a) screening for protein levels of a plurality of proteins, where the screening is 
performed in room air, and where the plurality of proteins are screened 
concurrently; 

(b) screening for protein levels of a plurality of proteins, where the screening is 
performed in the presence of decreased oxygen tension, and where the plurality of 
proteins are screened concurrently; and 

(c) correlating the protein level(s) with oxygen tension by identifying at least 
one different protein level between (a) and (b). 



The Examiner states that the instant specification provides support only for protein- 
protein interactions measured by two-hybrid systems (Office Action, page 4). Yet, the 
application specifically teaches that protein-protein interactions and protein levels can be 
screened by several well-established techniques, including those published by: 
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• Fung et al., 2001 (protein chips, microfluidic protein chips, peptide arrays, surface 
plasmon resonance (SPR), surface-enhanced laser desorption/ionization (SELDI), and 
time of flight-mass spectroscopy (TOF-MS); Exhibit 1, see page 65, right column to page 
66, left column; and page 67, left column), 

• Delneri et al., 2001 (two-dimensional gel electrophoresis, multidimensional protein 
identification technique (MudPIT), matrix-assisted laser desorption/ionization mass 
spectroscopy (MALDI-MS), and isotope-coded affinity tagging (ICAT); Exhibit 2, see 
page 88, right column to page 89, left column), 

• Zhu et al., 2001 (proteome chips; Exhibit 3, see page 2101, right column to page 
2102, left and center columns), and 

• Sakurai et al., 1998 (orphan receptor cell lines; Exhibit 4, see page 573, right column, 
bottom) (see, inter alia, page 9, lines 3-22 to page 11, lines 1-8 of the originally filed 
application). 

For the Written Description requirement, it is noted that Applicant need not disclose in 
detail those aspects of the invention which are well known to persons of ordinary skill in the art. 
Hybritech Inc. v. Monoclonal Antibodies, Inc., 802 F.2d 1367, 1379-80, and 1384 (Fed. Cir. 
1986); MPEP § 2163. This is particularly true for mature technologies, where publications and 
patents are available to skilled artisans. See In re Hayes Microcomputer Products, Inc., 982 F.2d 
1527, 1534-35 (Fed. Cir. 1992); MPEP §2163; see, also, Exhibits 1-4. 

Moreover, it is not necessary for the application to describe exactly the claimed subject 
matter, but only to convey with reasonably clarity that Applicant was in possession thereof. 
Union Oil Co. v. Atlantic Richfield Co., 208 F.3d 989, 997 (Fed. Cir. 2000). Here, Applicant's 
claims encompass methods of screening protein-protein interactions and protein levels and the 
instant application describes numerous well known and widely published approaches for 
performing such screens. 
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For at least the reasons set forth above, claims 1-3 and 5-9 presented herein, as well as 
newly added claims 13-17, are supported by the application as originally filed. Reconsideration 
is respectfully requested. 

35 U.S.C §102(b) 

Claims 1-3 have been rejected under 35 U.S.C. 102(b) as allegedly being anticipated by 
Cominacini et al., 1997, Free Radical Biology & Medicine, 22:117-127 (" Cominacini "; Office 
Action, page 4). According to the Office Action, Cominacini reports the expression levels of 
ICAM-1, VCAM-1, and E-selectin in the presence and absence of oxidized LDL protein (Office 
Action, page 5). 

Applicant respectfully traverses. 

Anticipation under 35 U.S.C. §102 requires that the cited reference teach every aspect of 
the claimed invention. Verdegaal Bros. v. Union Oil Co., 814 F.2d 628, 631 (Fed. Cir. 1987); 
MPEP §706.02 IV. Thus, "[t]he identical invention must be shown in as complete detail as 
contained in the... claim." Richardson v. Suzuki Motor Co., 868 F.2d 1226, 1236 (Fed. Cir. 
1989); MPEP §2131. 

Here, the claimed methods include i) screening for a protein-protein interaction between 
at least one protein and a plurality of proteins; ii) screening for protein-protein interaction(s) in 
room air and reduced oxygen tension; or iii) screening for protein levels of a plurality of proteins 
in room air and in reduced oxygen tension, none of which are taught or suggested by 
Cominacini . 

For at least these reasons, Cominacini cannot anticipate the subject matter of claims 1-3 
or 5-9 presented herein, or newly added claims 13-17. Reconsideration is respectfully requested. 

35 U.S.C. §103(a) 

Claims 1-3 have been rejected under 35 U.S.C. 103(a) as allegedly being unpatentable in 
view of Nishiyama et al., 1999, J. Biol Chem., 274:21645-50 (" Nishiyama "; Office Action, page 
6). According to the Office Action, Nishiyama reports the binding interaction between TRX and 
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TBP-2 in the presence of redox modifiers. Id. The Examiner states that it would have been 
obvious to one of skill in the art to characterize the binding interaction of TRX with other 
candidates. Id. The Examiner states that this characterization would merely involve the 
duplication of steps used for TRX and TBP. Id. 

Applicant respectfully traverses. 

As amended, claim 1 reads: 

1. A method of establishing a protein-protein interaction map comprising 

(a) screening for a protein-protein interaction between at least one protein and a 
plurality of proteins, where the screening is performed in the absence of a 
simulated redox state perturbation, and where the plurality of proteins are 
screened concurrently, 

(b) screening for a protein-protein interaction between the at least one protein 
and a plurality of proteins, where the screening is performed in the presence of a 
simulated redox perturbation, and where the plurality of proteins are screened 
concurrently, and 

(c) generating the protein-protein interaction map by identifying at least one 
different protein-protein interaction between (a) and (b). (Emphasis added). 

For analysis under 35 U.S.C. §103, it is essential to consider all of the elements of the 
claimed invention. W.L. Gore & Associates, Inc. v. Garlock, Inc., 721 F.2d 1540, 1548 (Fed. 
Cir. 1983); Jones v. Hardy, 727 F.2d 1524, 1530 (Fed. Cir. 1984); MPEP §2141.02. Each 
express claim limitation must be taken into account. See, e.g., Bausch & Lomb v. Barnes- 
Hind/Hydrocurve t Inc., 796 F.2d 443, 447-49 (Fed. Cir. 1986); MPEP §2141.02. 

Here, Applicant claims encompass a method of screening for a protein-protein interaction 
between at least one protein and a plurality of proteins, where the screening is performed in the 
presence and absence of a simulated redox state perturbation and where the plurality of proteins 
are screened concurrently. 

Yet, Nishiyama does not teach or suggest at least these elements in the pending claims. 
Nishiyama reports only the interaction of one protein (TRX) with one other protein (TBP-2) (see, 
e.g., Figure 3). Nishiyama does not teach or suggest the screening of at least one protein and a 



10 



Inventor: Jonathan S. Stamler 

U.S. Application Serial No. 09/977,693 



Attorney Docket No.: 24862-503 
(Former Docket No. Duke 1931) 



plurality of proteins. Nor does Nishivama teach or suggest the concurrent screening of the 
plurality of proteins. As such, Nishivama cannot make obvious the subject matter of the present 
claims. 

According to the Office Action, Nishivama shows that the interaction between TRX and 
TBP-2 is inhibited by the presence of certain redox reagents (Office Action, page 6). However, 
Nishivama reports that TRX was treated with "reducing/oxidizing reagents for 15 min at room 
temperature and then washed [five times] with degassed.,. buffer and subjected to a binding 
assay. . ." ( Nishivama, see page 21646, right column, and Figure 3). 

Thus, the reducing/oxidizing reagents are removed by Nishivama by several rounds of 
washing that precede the binding assay. In contrast, Applicant teaches and claims screens where 
the screening is performed in the W.^M^.3]}dabsmQ?. of a simulated redox state perturbation 
(see above). Because Nishivama does not teach or suggest at least these elements in the pending 
claims, it cannot make obvious the subject matter of these claims. 

Moreover, although a claimed invention is alleged to be within the capabilities of one of 
ordinary skill in the art, this is not sufficient by itself to establish obviousness. MPEP §2143.01; 
In re Kotzab, 217 F.3d 1365, 1371 (Fed. Cir. 2000). Even where a prior art reference may be 
capable of being modified as claimed, there must be a suggestion or motivation in the reference 
to do so in order to establish obviousness. In re Mills, 916 F.2d 680, 682 (Fed. Cir. 1990); 
MPEP §2143.01. 

Here, there is no suggestion or motivation to modify Nishivama to produce method of 
screening for a protein-protein interaction between at least one protein and a plurality of proteins 
and where the plurality of proteins are screened concurrently. Even assuming arguendo that the 
steps of Nishivama could be duplicated for other TRX binding candidates (which Applicant 
contests), this would involve the process of: 

i) Cloning the binding candidate from the cDNA library; ii) Subcloning the coding region 
from the binding candidate into a TNT expression vector; iii) Preparing and isolating the TRX- 
GST fusion protein; iv) Treating the TRX-GST fusion protein with DTT, diamide, or hydrogen 
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peroxide; v) Preparing and isolating in vitro translated, S-labeled binding candidate; vi) 
Washing the TRX-GST fusion protein; vii) Incubating the TRX-GST fusion protein with the 35 S- 
labeled binding candidate; and vii) Analyzing the binding by SDS-PAGE analysis and 
autoradiography. 

As such, Nishiyama is not amenable to concurrent (e.g., high throughput) screening and 
the "mere duplication" of steps in Nishiyama cannot be used to obtain the claimed invention. 

Applicant notes that the motivation to modify references under 35 U.S.C. §103 must 
come from the prior art, itself. In re Mills, 916 F.2d at 682. Thus, it is essential: 

"to forget what... has been taught about the claimed invention and cast the mind 
back to the time the invention was made to occupy the mind of one skilled in the 
art who is presented only with the references and... the then-accepted wisdom in 
the art." In re Fine, 837 F.2d at 1075 (citing W.L. Gore, 721 F.2d at 1553); 
MPEP §2141.01. 

It is impermissible to use the claimed invention as an instruction manual or "template" to 
piece together the teachings of the cited references so that the claimed invention is alleged to be 
obvious. In re Fritch, 972 F.2d at 1266. 

In the instant case, Nishiyama does not teach or suggest all of the elements in the pending 
claims and there is no independent suggestion or motivation to modify or combine, the reports of 
Nishiyama to obtain the claimed invention. For at least these reasons, pending claims 1-3 cannot 
be considered unpatentable over Nishiyama and a prima facie case of obviousness has not been 
established. Withdrawal of this rejection is respectfully requested. 

CONCLUSION 

A favorable action on the merits is respectfully requested. If further discussion of this 
case is deemed helpful, the Examiner is encouraged to contact the undersigned at the telephone 
number provided below, and is assured of full cooperation in progressing the instant claims to 
allowance. 
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Applicant believes no further fee is due at this time; however, the Commissioner is 
authorized to charge any additional fees that may be due, or to credit any overpayment, to the 
undersigned's account, Deposit Account No. 50-0311 . Reference No. 24862-503. Customer No. 
35437. 



Date: July 18, 2005 Respectfully submitted, 

r-_ C2r&\ 1-4- 

Ivor R. Elrifi, Reg. No. 39,529 
Caryn DeHoratius, Reg. No. 45,881 
Attorneys for Applicant 
MINTZ, LEVIN, COHN, FERRIS, 

GLOVSKY and POPEO, P.C. 
666 Third Avenue, 24th Floor 
New York, New York 10017 
Telephone: (212)935-3000 
Telefax: (212)983-3115 
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Protein biochips for differential profiling 

Eric T Fung*, Vanitha Thulasiramant, Scot R Weinbergert 
and Enrique A Dalmasso§ 



Progress has been made in utilizing ProteinChip® technology 
to profile and compare protein expression in normal and 
diseased states, particularly in the areas of cancer, infectious 
disease and toxicology. The past year has also seen the 
development of several novel chip types designed to analyze 
proteins in a fashion analogous to the array-based format of 
DNA microarrays. Some of these platforms may be used for 
differential profiling. 
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Abbreviations 

ACPm acyl carrier protein 

INH isoniazide 

LCM laser capture microdissection 

MALDI matrix-assisted laser desorption/ionization 

PSA prostate-specific antigen 

PSMA prostate-specific membrane antigen 

SELDI surface-enhanced laser desorption/ionization 

TOF time-of-flight 

Introduction 

Virtually all strategies for studying cellular function require 
the comparison of control states with perturbed states. At 
the heart of all such strategies is the requirement for an 
assay that sensitively and specifically measures differences 
in gene or protein expression or function. In the genomic 
and proteomic era, these assays no longer limit themselves 
to individual or small sets of genes or proteins; rather, they 
seek to examine the full complement of protein expression 
in a cell under a given set of physiological conditions. 
Because this represents a daunting task, protein differen- 
tial display techniques that compare protein profiles 
between control and experimental populations have 
become increasingly useful. In these systems, expression 
of proteins common to both groups is ignored and empha- 
sis is placed upon identifying and quantifying those 
proteins whose expression level is either upregulated or 
downregulated. These proteins become potential bio- 
markers representative of a given metabolic disease, drug 
reaction, neoplasm or microbial infection and are thus 
diagnostic or possible therapeutic targets. 

Chip-based protein differential display systems have, for sev- 
eral reasons, been significantly more difficult to develop than 



their counterparts that examine differential gene expression. 
First, whereas the reverse-transcriptase polymerase chain 
reaction (RT-PCR) permits the amplification of mRNA, 
there is no analogous method to amplify protein expression. 
Second, for most functional assays (e.g. interaction studies), 
the proteins must be immobilized on the surface of the chip 
such that they retain their native conformation and also such 
that their active site(s) are exposed rather than buried. 
Additionally, the heterogeneity of the biophysical properties 
of proteins makes it virtually impossible for any single sur- 
face chemistry to bind the full complement of proteins 
present in a cell, particularly if one attempts to maintain the 
proteins in their active state. Finally, the capacity of the chip 
must be sufficient to allow as complete a representation of 
the proteome to be visualized as possible; abundant proteins 
(metabolic and cytoskeletal) overwhelm the detection of less 
abundant proteins such as signaling molecules and receptors, 
which are generally of more therapeutic interest. Strategies 
used to circumvent these problems are discussed by 
Weinberger [1]. 

In this review, we focus on a subset of chip-based assays 
that may be used to compare protein expression in normal 
and perturbed states. The most advanced commercially 
available system for this purpose is the ProteinChip® pro- 
teomics platform, and so most of the review will be devoted 
to describing applications of this system. The review will 
also describe briefly some of the more novel and unique 
chip-based assays that have been described in the litera- 
ture. Because of space limitations, we cannot cover all 
chip-based proteomics formats. Another commonly used 
chip technique, surface plasmon resonance (SPR), is 
described in greater detail by Scheller et aL in this issue 
(pp 35—40) and by Rich and Myszka [2] in a recent issue of 
this journal. We refer the reader to reviews by Figeys [3] 
and Guetens etal. [4] for discussions of microfluidic protein 
biochips, and peptide arrays are reviewed by Schneider- 
Mergener and colleagues in this issue (pp 59-64). 

ProteinChip® proteomics 

Perhaps the most established chip-based proteomics profil- 
ing platform is the incorporation of ProteinChip® technology 
with mass spectrometry, as commercially available from 
Ciphergen Biosystems, Inc. (reviewed in [5*]). At the heart of 
this technology are the ProteinChip® arrays, which have 
varying chromatographic properties, for example anion 
exchange, cation exchange, metal affinity and reverse phase. 
A complex mixture of proteins, as from cells or body fluids, 
can be reduced to sets of proteins with common properties 
by binding the sample to chips with differing surface 
chemistries in parallel and in series. After the chips are 
washed to remove unbound proteins, the bound proteins are 



66 Analytical biotechnology 



Figure 1 




Protein profiling of Yersinia pestis cultured at its two physiological 
temperatures. 1 0 ug of crude cytosolic extracts of Y pestis grown at 
26°C versus 37*C were analyzed on a strong anion exchange chip 
(SAX-2). Proteins expressed only at 37 °C, selected for purification 



and identification, were the 14.9 kDa (boxed) and 78.8 kDa (inset) 
proteins. The 1 4.9 kDa protein was identified as antigen 4 and the 
78.8 kDa protein as the catalase/peroxidase KatY protein. Adapted 
from [6] with permission. 



read in a time-of-flight mass spectrometer (TOF MS). The 
resulting spectra give a multidimensional binding picture on 
the basis of different types of interaction. This process, 
known as surface-enhanced laser desorption/ionization 
(SELDI), has several advantages over matrix-assisted laser 
desorption/ionization (MALDI). As in MALDI, the sample 
is admixed with a small acidic molecule (the matrix) that 
crystallizes around the sample. In SELDI, however, the 
ProteinChip® array acts as a surface to which the sample 
binds uniformly and the matrix is placed on the chip only 
after the proteins are bound to the chip. Consequently, the 
spectra obtained are more uniform and reproducible as com- 
pared with MALDI-obtained spectra. This difference 
enables relative protein quantitation with SELDI that is not 
possible with MALDI. Although SELDI has numerous 
applications, we focus in this review on one specific applica- 
tion, protein profiling, and refer the reader to the review by 
Merchant and Weinberger [5*] for descriptions of some of its 
other applications. 

Protein profiling or protein differential display studies have 
been widely used in the area of disease research, as com- 
parison of lysates from normal versus diseased cells can 
reveal the expression of important marker proteins. Lysates 
from disease and control samples are processed on the same 
types of chip surfaces and the chips are read under the same 
data collection conditions. Subsequent peak comparison 
allows the identification of differences. Once a peak of 
interest has been detected, the analyte can be enriched or 
purified for further analysis. This is possible by washing the 
chip with varying stringencies of pH, salt or organic solvent, 
depending on the type of chip surface chemistry. Once a 



peak is sufficiently purified, on-chip digestion with prote- 
olytic enzymes followed by analysis of the peptide patterns 
can yield important identification information. At the cur- 
rent stage of the technology, protein identification 
applications are still somewhat limited by the mass accura- 
cy of the ProteinChip® reader and it is therefore important 
to use careful calibration techniques with known peptides 
of comparable molecular weight to obtain reliable peptide 
maps. Even with a mass accuracy in the milliDalton range, 
the proteomic researcher is cautioned that the protein chest 
of nature contains enough redundancy and sequence con- 
servation to make wrong identification calls on the basis of 
peptide masses. The combination of profiling using 
ProteinChip®, arrays with powerful fragmentation and 
sequencing capabilities can enable the researcher to obtain 
high-resolution data and novel protein information directly 
from biological samples. 

As an example of profiling followed by protein identifica- 
tion in the realm of infectious disease, the ProteinChip® 
technology has been used to compare the pattern of pro- 
tein expression in two physiological states of Yersinia pestis, 
the causative agent of the plague. Because it exists in two 
carriers, the flea and the rodent, Y pestis has evolved to 
express different sets of proteins at the different tempera- 
tures encountered in the carrier organisms (26°C and 
37°C). Thulasiraman et al. [6] have used ProteinChip® 
technology to identify two proteins whose expression is 
regulated by temperature. Lysates of Y pestis grown at the 
two different temperatures were fractionated on strong 
anion exchange and immobilized metal-affinity chro- 
matography chips, and the protein expression profiles were 
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compared (Figure 1). Two of che proteins seen to be unreg- 
ulated at 37°C were purified and identified as the 
catalase/peroxidase KatY protein (78.8 kDa) and antigen 4 
(14.9 kDa) via mass spectrometric analysis of tryptic 
peptide fingerprints. 

ProteinChip® technology is also useful in linking gene- 
array expression data with protein discovery and, unlike its 
gene-based counterpart, can be used to examine post- 
radiational modification of proteins. This approach was 
applied to verify upregulation of the acyl carrier protein 
(ACPm) in Mycobacterium tuberculosis treated with isoni- 
azide (INH), an antifungal agent known to disrupt both 
micolic acid and cell wall synthesis. Transcription-array 
studies of INH-treated M. tuberculosis indicated a 4.3-fold 
increase in ACPm mRNA levels after a six hour exposure 
to INH. Lysates of control and INH-treated M. tuberculosis 
cultures were examined using anion exchange 
ProteinChip® arrays and SELDI-TOF MS, followed by 
analysis of an on-chip protein digest of a 13,217 Da protein 
using a laser desorption/ionization triple quadrupole TOF 
MS capable of MS/MS operation [7]. This protein, which 
was determined to be ACPm conjugated with its cofactor 
(phosphopantetheine) and a C 2 6 fatty acid, appeared to be 
downregulated on the basis of ProteinChip® array data. 
However, closer examination of the ProteinChip® array 
data revealed that peaks found at 12,943.9, 12,997.7 and 
13,025.0 Da in the INH-exposed lysates were in fact 
ACPm-phosphopantetheine conjugated with C 6 , C 10 and 
C 12 fatty acids, respectively. In this manner, the presence 
of INH disrupted proper cell wall synthesis, reducing the 
abundance of the ACPm-C 2 6 conjugate while elevating 
levels of the smaller fatty acid varieties to a point where 
total ACPm abundance was upregulated when compared 
with the control group. 

Cancer research in particular has embraced platforms that 
enable differential protein profiling, as the identification of 
upregulated or downregulated proteins suggests the pres- 
ence of a tumor marker that might be used for diagnosis, 
prognosis monitoring of disease progression or therapeutic 
success and/or as a therapeutic target. Cancer specimens 
can be compared with normal specimens, or cancerous 
regions of a biopsy or specimen can be directly compared 
with non-cancerous regions of the same specimen. Specific 
regions of a specimen can be selected for analysis using a 
technique called laser capture microdissection (LCM) [8]. 
Petricoin and colleagues [9] have used LCM in conjunc- 
tion with ProteinChip® proteomics to study protein 
expression profiles in patient-matched normal colon, can- 
cerous and metastatic samples with the generation of 
distinct protein profiles for each group. In particular, the 
authors used this approach to study normal prostate, pro- 
static intraepithelial neoplasia (widely considered to be a 
precursor lesion to invasive carcinoma) and prostate cancer, 
and they showed that the cancer specimens demonstrated 
downregulation of a 28 kDa protein. A similar study was 
performed on head and neck cancers [10]. 



Although SELDI studies are routinely performed on tissue 
samples, isolation of markers from body fluids, particularly 
serum or urine, may prove more valuable for diagnostic 
purposes. Wright et al. [11] have taken the approach of 
studying both tissue and body fluids in the search for bio- 
markers of prostate cancer. In this study, the authors 
showed that ProteinChip® proteomics could be used to 
identify known prostate cancer-associated biomarkers 
(prostate-specific antigen [PSA], prostatic acid phos- 
phatase, prostate-specific membrane antigen [PMSA] and 
prostate-specific peptide) in cell lysates of LCM-captured 
prostatic cells and body fluids from prostate cancer 
patients, as well as to discover several potential biomark- 
ers, including a 33 kDa and an 18 kDa protein found to be 
upregulated in prostate cancer cells. When the authors pre- 
fractionated the samples by size exclusion chromatography 
followed by ionic exchange chromatography before bind- 
ing to the ProteinChip® array, they could detect over 300 
protein peaks. On the basis of this approach, over 30 pro- 
teins were detected as being either over-expressed or 
under-expressed in specimens from prostate cancer 
patients [12]. Although no single protein was found to dis- 
tinguish prostate cancer from the non-cancer groups, a 
combination of multiple protein peaks was capable of dis- 
criminating prostate cancer from a normal age-matched 
healthy male population. These studies were performed 
on a variety of body fluids, including serum, urine and 
seminal plasma. 

The ProteinChip® platform used to identify biomarkers can 
also be used to develop a rapid, sensitive and high-through- 
put multi-marker assay. The premise of this approach is first 
to establish composite fingerprint profiles of both disease 
and non-disease states from a series of training samples, and 
then to use these composite profiles to make a diagnosis on 
actual unknown patient samples. In this case it is not essen- 
tial to identify the proteins to make a diagnosis. Moreover, 
by utilizing a group of biomarkers one is not constrained by 
the sensitivity and specificity of any single biomarker, which 
may be relatively low. For example, by evaluating multiple 
urinary proteins by SELDI, the detection rate for low 
stage/grade bladder cancer increased to greater than 75%, 
compared with 30% by conventional urine cytology [13]. 
Another potential clinical application of the SELDI system 
is to develop immunoassays by immobilizing antibody to a 
specific biomarker on the ProteinChip® array surface. 
Wright and co-workers [14,15] have successfully used this 
approach to measure free PSA and complexed PSA, and 
PSMA in serum and seminal plasma. Interestingly, this plat- 
form was successful for quantitation of PSMA in serum 
where other immunoassay formats had failed. Also of inter- 
est was the detection of other possible PSMA isoforms that 
appear to be differentially expressed in normal, benign and 
malignant prostate samples, and that would not have been 
detected with other assay formats. By adding a variety of 
antibodies to a single array, it is possible to develop a multi- 
plex assay to measure simultaneously eight or more 
biomarkers using a single ProteinChip® array. 
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New chip platforms for protein expression 
analysis 

In recent months, several important papers have described 
the development of novel chip techniques, all of which 
employ the same concept as DNA arrays — to place thou- 
sands of addressable spots in a small grid. Emili and 
Cagney [16] propose segregating such arrays into nonliving 
and living arrays. Nonliving arrays consist of grids of pro- 
teins or peptides and are assayed biochemically, whereas 
living arrays consist of grids of organisms, typically bacteria 
or yeast, and are assayed metabolically. The first large- 
scale array of proteins was developed as a 'living array' to 
study two-hybrid interactions — yeast colonies expressing 
specific combinations of proteins were spotted in known 
positions and their phenotype (growth on selective media) 
was assayed [17]. Such a platform could be adapted for pro- 
filing analysis by constructing numerous types of selective 
media and replica plating grids of yeast onto each sub- 
strate. Extension of such a principle to the study of 
mammalian cells (e.g. tumor cell lines) will prove more 
technically challenging. 

Another example of a living array is the antibody array 
recently described by de Wildt et al. [18**]. The arrays cre- 
ated by these authors contain a collection of 18,342 bacterial 
clones, each expressing a different single-chain antibody. 
These are spotted onto a 22 x 22 cm filter in duplicate, or 
duplicate filters might be used. It may be possible to use 
this technique in the future to randomly generate antibod- 
ies, to create arrays of these randomly generated antibodies, 
to incubate lysates of relevant tissue or even body fluids 
with the filters and to identify upregulated or downregulat- 
ed proteins. The proteins of interest can then be enriched 
and identified using, for example, MALDI-MS. 

Recently, MacBeath and Schreiber [19*] have described a 
novel protein microarray that utilizes glass slides coated 
with an aldehyde-containing silane reagent. The aldehy- 
des react with primary amines (contained primarily in 
lysines and at the amino terminus) exposed on the pro- 
tein's surface, leading to the binding of proteins in many 
different conformations on the chip. In a small-scale test of 
this chip, protein-protein interaction studies were per- 
formed — baits were printed onto the slide and probed 
with fluorescently labeled partners. In a larger-scale test, a 
slide was printed containing 10,799 spots of protein G and 
one spot of specific bait (FKBP12-rapamycin-binding pro- 
tein, FRP) and was probed with a mixture of 
BODIPY-FL-IgG and Cy5-FKBP. All of the protein 
G spots were bound by the BODIPY reagent whereas the 
single spot of FRP was detected by the FKBP. The chal- 
lenge is to extend this system to permit the analysis of 
large sets of unique proteins. 

A distinct type of protein array that has been utilized for 
differential protein expression analysis is the tissue 
microarray [20]. This type of array consists of cylinders of 
tissue 0.6 mm in diameter punched out from paraffin 



blocks. Grids of these cylinders are constructed by aligning 
cylinders from a number of samples (e.g. several thousand) 
into a recipient paraffin block, and serial sections are cut. 
Each section can then be analyzed by fluorescent in situ 
hybridization or immunohistochemistry. Although this 
technique obviously relies on choosing specific antigens 
for visualization, it does allow the concurrent examination 
of potentially thousands of specimens across a gradient of 
severity (e.g. benign, in situ carcinoma, invasive carcinoma 
and metastatic carcinoma). This technique has been used 
to examine potential tumor markers in bladder cancer, 
renal cell carcinoma and prostate cancer [21-23] and 
combined with cDNA microarray analysis to study 
glioblastomas [24]. 

Conclusions 

The ProteinChip® proteomics platform provides an out- 
standing system for chip-based analysis of differential 
protein expression, given its low sample requirements and 
high-throughput format. Moreover, once a specific bio- 
marker is found, the same platform may be used for 
diagnosis. This is particularly useful if a panel of biomark- 
ers is more sensitive or specific than a single biomarker. 
Because this platform does not rely on protein conforma- 
tion for detection, it has a great advantage over new chip 
platforms designed to bind recombinant proteins in an 
array-based manner analogous to DNA microarrays. In their 
current state, these systems suffer from a number of 
requirements: the identity of the protein placed at each 
position must be known, the recombinant protein must be 
soluble, the protein must be folded such that an appropri- 
ate domain for detection is displayed on the chip surface 
and a detection method must be either available or devel- 
oped. To be useful for general protein profiling, future 
generations of these techniques will require adaptability in 
design and detection. It is likely that the next iteration of 
microarray-based protein chips will be best suited for the 
study of specific families of proteins, as family members 
share a common domain to which a detecting moiety can be 
synthesized. Enzymes make particularly good profiling tar- 
gets, as the detection of their activity is a natural assay (as 
an example, see Update). Future generations of these tech- 
niques will likely rely on the expression of fusion proteins 
that incorporate one or more domains that can be used 
simultaneously to increase the solubility of the protein, ori- 
ent the protein on the chip surface and act as a detection 
substrate (e.g. change in fluorescence wavelength when a 
ligand is bound). Molecular breeding will almost certainly 
play a role in the development of such domains, while 
advances in surface chemistry will be needed to increase 
the capacity of chip surfaces. Finally, improvements in the 
sensitivity and sequencing capabilities of mass spectrome- 
try instrumentation will enable identification of the 
proteins that are upregulated or downregulated. 

Update 

Recently, a chip designed to assay kinase activity was 
described [25*]. Because the entire yeast genome has been 
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sequenced, these researchers could clone and express glu- 
tathionc-S-transferase (GST) fusions of almost all the yeast 
kinases. The kinase-assay chip was constructed by first 
creating a mold over which a liquid silicone elastomer was 
poured. This was allowed to harden, was peeled away and 
was then placed on a glass slide. This array, which mea- 
sured 28 mm x 14 mm, contained wells 1.4 mm in 
diameter and 300 \im deep. The potential substrates were 
covalently attached to the walls of the wells and incubated 
with recombinant kinase in the presence of radiolabeled 
ATP. After the kinase reaction was completed, the wells 
were washed and the chip exposed to either X-ray film or 
a phosphoimager. In this manner, each kinase could be 
tested against each of the different substrates. It is easy to 
envision modifications of this system to study other enzy- 
matic activities such as GTP cleavage or protein lipidation, 
as well as to serve as a platform for studying the affects of 
drug candidates on such enzymatic activities. 
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Towards a truly integrative biology through the functional 
genomics of yeast 

Daniela Delneri*§, Francesco L Brancia*t # and Stephen G Oliver*t 



A complete library of mutant Saccharomyces cerevisiae 
strains, each deleted for a single representative of yeast's 
6000 protein-encoding genes, has been constructed. This 
represents a major biological resource for the study of 
eukaryotic functional genomics. However, yeast is also being 
used as a test-bed for the development of functional genomic 
technologies at all levels of analysis, including the 
transcriptome, proteome and metabolome. 
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Abbreviations 

2DGE two-dimensional get electrophoresis 
ICATs isotope-coded affinity tags 
MS mass spectrometry 
ORF open reading frame 

Introduction 

The availability of complete genome sequences presents 
the opportunity of adopting a systems-based approach to 
biology that will allow the determination of how all the 
genes in a genome act and interact to produce a function- 
ing organism. Such an approach demands that technologies 
are developed that allow analyses at the level of mRNAs 
(transcriptome), proteins (proteome) and low molecular 
weight intermediates (metabolome) to be carried out in as 
comprehensive a manner as possible. Moreover, high- 
throughput methods of generating defined mutants and 
assessing their phenotypes also are required. The full 
range of genomic technologies is currently being applied to 
a small number of model organisms. Of these, the yeast 
Saccharomyces cerevisiae is in the vanguard. 

The three levels of functional genomic analysis are both qual- 
itatively and quantitatively distinct. Messenger RNA 
molecules, the subject of transcriptome analysis, may be stud- 
ied in a fully comprehensive manner using the massively 
parallel technique of hybridisation-array analysis. However, 
mRNA molecules are not functional entities within the cell, 
but simply transmitters of the instructions for synthesising 
proteins, and so transcriptome analyses only approach func- 
tionality in an indirect manner. While both proteins and 
metabolites represent true functional entities within cells, the 



analysis of the proteome is fraught with technical difficulties, 
and most techniques in current use are neither comprehen- 
sive in their scope, nor massively parallel in their execution. 
The main difficulty with metabolome analysis is conceptual, 
rather than technical. While it is technically feasible to simul- 
taneously analyse several hundred metabolites at once, the 
relationship between the metabolome and the genome dif- 
fers from that of the transcriptome or proteome in that it is 
indirect. Many genes may be involved in the biosynthesis or 
degradation of a single metabolite. Thus bioinformatic tools 
are required that permit prior knowledge of the metabolic 
impact of known genes to be exploited in elucidating the 
function of novel genes. 

It is bioinformatics that holds the key to functional 
genomics, since it is clear that investigations at all the lev- 
els of 'omic' analysis, and also phenotypic studies, will be 
required in taking this integrative approach to biology. 
Thus, massive amounts of data, of qualitatively and quan- 
titatively distinct types, must be integrated, compared and 
used to construct heuristic models of living systems. In this 
short review, we will attempt to look at all of the levels of 
genomic analysis and discuss the prospects for their use in 
gaining an integrative view of the workings of S. cerevisiae. 

Genome 

A major conceptual and practical problem for the system- 
atic analysis of gene function in eukaryotes is that of 
genetic redundancy. Gene duplications could have 
occurred through a series of local events or by complete 
genome duplication [1,2]. The high level of redundancy, 
generated by a whole-genome duplication, is thought to 
have been reduced via deletions and chromosomal 
rearrangements, while sequence divergence and selection 
allowed the acquisition of new functions. Thus, much of 
the redundancy in the yeast genome may be more appar- 
ent than real, with identical, or almost identical, gene 
products fulfilling distinct physiological roles due to differ- 
ential gene expression or the targeting of similar proteins 
to different cellular compartments. For example, Delneri 
et al. [3] showed that only one fully functional member of 
the aryl-alcohol dehydrogenase (AAD) gene family in 
S. cerevisiae responds to oxidative stress. Moreover, theo- 
retical studies have indicated that the genetic robustness 
of this organism does not rely on gene redundancy [4*]. 

Considerable progress has been made in the analysis of yeast 
gene function using single open reading frame (ORF) dele- 
tion mutants. However, it must be appreciated that the 
failure of any given mutant to reveal a phenotype may be the 
consequence of either genetic redundancy or of effective 
homeostatic controls within the cell. Thus, more studies are 
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required in which entire gene families are deleted [5,6] or 
where phenotype is assessed quantitatively, rather than qual- 
itatively [7,8]. The year 2000 saw the establishment of a 
collection of mutant yeast strains, each bearing a defined 
deletion in one of yeast's 6138 potential protein-encoding 
genes (for details, contact euroscarf@em.uni-frankfurt.de). 
Each deleted ORF is flanked by two 20 bp molecular bar- 
codes that are unique for each deletion [9"]. These allow the 
parallel analysis of the phenotypes of a large number of dele- 
tants to be performed using competitive growth assays. Such 
analyses assume that the specific ORF replacement is the 
only genetic change in the deletant. However, the transfor- 
mation is both mutagenic and recombinogenic, and 
competitive hybridisation-array studies have shown that 
deletants may be aneuploid for whole chromosomes or 
chromosomal segments [10]. 

Ross-Macdonald et al. [11*] have taken a different approach 
to the generation of a large collection of defined yeast 
mutants. They used a transposon to mutagenise a yeast 
clone library in Escherichia coli. Individual plasmids were 
then prepared and used to transform a diploid yeast strain, 
where each plasmid integrates at its corresponding chromo- 
somal locus, replacing the endogenous copy of the gene. 
The structure of the transposon allowed them to insert a 
short haemagglutinin tag within the yeast ORF that may be 
used for immunolocalisation of the tagged proteins. This 
permitted more than 300 previously nonannotated ORFs to 
be identified and the localisation of their protein products 
to be determined. This collection of mutants has also been 
used to determine disruption phenotypes for about 8000 
strains, using 20 different growth conditions. 

Genome-wide expression analysis (see below) has been used 
to follow adaptive evolution in yeast. The global transcript 
profiles from mutants selected during aerobic growth in a glu- 
cose-limited chemostat were compared with that of their 
parental strain. Genes involved in glycolysis and the tricar- 
boxylic acid cycle showed alterations in expression in all three 
independently evolved strains, indicating that increased fit- 
ness is acquired by reducing the percentage of glucose which 
is fermented and increasing that channelled to respiration [12]. 

Transcriptome 

Hybridisation arrays are now used widely to study the effects 
of cell physiology, development biology, or genetic constitu- 
tion on the global expression pattern of yeast. For instance, 
yeast genes regulated directly or indirectly by the transcrip- 
tional activator Yaplp have been identified. The 
Yaplp-binding site was not always found in the promoter 
region of the target genes, and such genes are presumably 
under the indirect control of the activator. As ever, it is impor- 
tant to carefully define and regulate cell physiology when 
carrying out these global analyses. Thus, the RPJ1 gene 
(which encodes a repressor of the Ras-cAMP pathway) was 
found to be downregulated by Yaplp during the exponential 
growth phase, but upregulated in the stationary phase or fol- 
lowing oxidative stress [13]. Careful control of cell physiology 



was employed by Ter Linde etal [14] in their investigation of 
adaptation to aerobiosis and anaerobiosis in S. cerevisiae. About 
93% of the ORFs analysed were expressed during both aero- 
bic and anaerobic conditions, but about 140 and 219 genes 
showed a threefold higher transcription level under anaerobic 
and aerobic conditions, respectively. 

Hughes etal [15*] have recently constructed a Compendi- 
um' of expression profiles corresponding to 300 diverse 
mutation or chemical treatments. In this way, the authors 
were able to assign functions to uncharacterised ORFs 
determining the biochemical and cellular pathways affect- 
ed, via pattern matching. The compendium also was used 
to determine a novel target of the drug dyclonine, a sodi- 
um-channel blocker. 

Proteomics 

Although the yeast proteome (as it came to be known) has 
been studied, using two-dimensional gel electrophoresis 
(2DGE) since the late 1970s, progress in identifying the 
proteins contained within the spots on such gels has been 
disappointingly slow [16]. The availability of the complete 
yeast genome sequence and the development of 'soft' ion- 
isation techniques for mass spectrometry (MS) have done 
much to speed up spot assignments, but improvements in 
MS and bionformatic techniques are still required. Recent 
advances include a method for de novo peptide sequencing 
[17] that improves fragmentation efficiency in post-source 
decay experiments, the use of guanidination to improve 
the signal response of carboxy-terminal lysine peptides 
[18], and the combination of the latter with Edman-type 
derivatisation [19]. This double-derivatisation approach 
offers a dual advantage, as both the total number of pep- 
tide masses available for database searching is increased, 
and the search space is reduced due the identification of 
the carboxy- and amino-terminal amino acids [20]. 

Yates Jr and co-workers [21] have developed a method for pro- 
tein separation that is an attractive alternative to 2DGE. 
Known by the acronym MudPIT (multidimensional protein 
identification technique), it preserves protein-protein interac- 
tions and so facilitates the analysis of multisubunit complexes. 
MudPIT combines reversed-phase liquid chromatography 
with either cation-exchange or size-exclusion chromatography 
prior to analysis with electrospray tandem MS. This method 
was used to dissect the yeast 80S ribosomal subunit and result- 
ed in the identification of an additional 1 1 proteins that had 
not previously been detected using the 2DGE approach. 

An alternative approach to the identification of pro- 
tein-protein interactions was applied in the analysis of 
the yeast nuclear pore complex (Nup85p), [22]. The com- 
plex was isolated using an affinity tag, crosslinked, and 
the resulting proteins pairs were resolved in sodium 
dodecyl sulphate-polyacrylamide gel electrophoresis and 
identified via matrix-assisted laser desorption/ionisation 
(MALDI)-MS, providing a model of the spatial organisa- 
tion of the complex. 
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Gygi et al. [23"] combined sequence identification of the 
components in complex mixtures and accurate relative 
quantification. It exploits a novel class of chemical reagents, 
known as isotope-coded affinity tags (ICATs), together with 
tandem MS. The ICAT reagents contain both a biotin moi- 
ety and a thiol group, which is covalently attached to each 
cysteinyl residue in every protein. These two moieties are 
joined by a linker that contains either normal hydrogen or its 
heavy isotope, deuterium. Two protein extracts, obtained 
from yeast cultures grown on either ethanol or galactose, 
were compared by treating them with the isotopically light 
and heavy ICAT reagents, respectively. 

A comprehensive analysis of protein-protein interaction in 
yeast has been undertaken recently by Uetz et al. [24*]. 
They employed two complementary strategies to exploit 
the yeast two-hybrid system for high-throughput analyses. 
The array approach (using pairwise crosses) yields more 
positive interactions, while the high-throughput library 
approach is more comprehensive, but less productive. Of 
the 12 'baits' that gave positive interactions with both 
screens, 48 possible partners were identified by the array 
approach, against only 14 in the library screen. 

The relationship between mRNA and protein expression 
levels was investigated recently in a genome-wide context. 
Two groups of scientists made independent analyses com- 
paring, under a given set of physiological conditions, the 
amount of proteins from a two-dimensional protein gel with 
the corresponding amount of transcripts calculated from the 
published serial analysis of gene expression (SAGE) analysis. 
Futcher et al. [25] found a satisfactory correlation between 
mRNA abundance, protein abundance and codon bias (mea- 
sured in glucose and ethanol media). On the other hand, 
Gygi etal. [26] found that, for some genes, the mRNA abun- 
dance and the corresponding protein levels varied by more 
than 20-fold. According to this second study, the yeast pro- 
teome could not yet be predicted from the simple deduction 
of the transcript level because of the limits of the current 
approaches for quantitative analysis of protein levels. 

Metabolomics 

There are two major approaches to the assignment of gene 
function via metabolic analyses. One, which provides a direct 
link to the genome, is to uncover the biochemical reactions 
catalysed by enzymes encoded by genes of unknown function. 
Such an approach has been adopted by Martzen and co-work- 
ers: they developed a genomic strategy to identify yeast genes 
specifying biochemical activities by constructing a library of 
plasmids expressing glutathione ^-transferase tagged yeast pro- 
teins. Using this strategy, they were able to identify proteins 
with novel biochemical activities [27*]. The problem of such an 
approach, as with the assignment of function via sequence 
homology, is that it attributes mechanism, rather than biologi- 
cal function, and is completely context-independent. 

An alternative approach is to study the change in the cell's 
metabolic profile (or metabolome) which results from the 



deletion or overexpression of a given gene, and to assign 
function by comparing the metabolome change that result 
from the deletion of unknown genes with those that occur 
due to similar manipulations of known genes. This approach 
has been applied to S. cerevisiae and has been termed 
FANCY (for functional analysis by co-responses in yeast) 
[28*]. It involves a comprehensive analysis of cellular 
metabolites using either MS or nuclear magnetic resonance 
spectroscopy, combined with sophisticated chemometric 
analysis of the resulting spectra. The approach is able to 
reveal a phenotype for gene deletions that have no measur- 
able effect on cell growth and cluster together metabolome 
profiles resulting from the deletion of genes affecting simi- 
lar domains of metabolism. Yeast has a rather a limited range 
of metabolites, but a similar approach has been successfully 
employed with the plant, Arabidopsis thaliana [29*], which 
produces a much more complex range of metabolites. 

Bioinformatics 

Thousands of data points are accumulated in a single 
genome-wide expression experiment. In this context, the 
role of bioinformatics becomes essential in order to make 
biological sense out of the data and to assign functions to 
uncharacterised coding regions. An obvious approach to 
assigning gene function from transcriptome data is to 
group together genes with similar expression profiles [30]. 
Such methods analyse patterns of gene activity in an 
'unsupervised* fashion. That is, without recourse to a train- 
ing set of data relating to genes of well-known function 
and regulatory pattern [31]. Recently, Brown et al. [32] 
introduced a method for the identification of functionally 
related yeast proteins based on the theory of support vec- 
tor machines, which represent a supervised learning 
technique that exploits prior knowledge of gene function 
to identify clusters. An alternate approach that exploits 
genetic programming has been published recently and 
appeared to be even more successful in that it proved able 
to learn the class of helix-turn-helix proteins, which 
include the transcription factors [33]. 

A different approach to the cluster analysis has been devel- 
oped by Marcotte et al. [34**]. They grouped proteins by 
'experimental data', 'related metabolic function', 'related phy- 
logenetic profiles', 'rosetta stone method' (which links 
individual proteins whose homologues, in other organisms, are 
combined into a single multifunctional complex), and 'corre- 
lated mRNA expression'. Using these methods, they found a 
total of 93,000 pairwise links between functionally related 
yeast proteins, allowing the assignment of a general function to 
more than half the uncharacterised ORFs. Such an approach is 
an excellent example of the benefits of integrating data from 
all levels of functional genomic analysis. These integrative 
approaches require database structures that have sufficient 
breadth and flexibility to allow complex queries to be made 
over the qualitatively and quantitatively different datasets rep- 
resented by the 'ornes'. An appropriate object data model has 
been constructed recently that permits the integration of 
genome, transcriptome and proteome data for yeast [35]. 
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Conclusions 

The fact that the genome sequences of even well-charac- 
terised organisms contained about 40% of genes whose 
function had neither been established nor predicted was a 
shock to many biologists. The challenge was to assign a func- 
tion to each of these novel genes. However, the 
comprehensive methods of analysis that are used to pursue 
these assignments have revealed that our view of biological 
function is rather one-dimensional. The hope now is that 
the analysis of genome, transcriptome, proteome and 
metabolome, as well as the phenotype, will allow a much more 
integrative view of biology at the level of the whole organism. 
In the early stages of such a process, single-celled organisms 
offer the advantages of simplicity combined with genetic and 
physiological malleability. The experimental methods and 
theoretical framework established using an organism such as 
yeast should provide a firm foundation for an integrative biol- 
ogy of human beings and their domestic plants and animals. 
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To facilitate studies of the yeast proteome, we cloned 5800 open reading 
frames and oyerexpressed and purified their corresponding proteins. The 
proteins were printed onto slides at high spatial density to form a yeast 
proteome mieroarray and screened for their ability to interact with proteins 
and phospholipids. We identified many new calmodulin- and phospholipid- 
interacting proteins; a common potential binding motif was Identified for 
many of the calmodulin-binding proteins. Thus, microarrays of an entire 
eukaryotlc proteome can be prepared and screened for diverse biochemical 
activities. The microarrays can also be used to screen protein-drug inter- 
actions and to detect posttranslational modifications. 



A daunting task after a genome has been 
fully sequenced - is to understand die func- 
tions, modification, and regulation of every 
encoded protein (1). Currently, much effort 
is devoted toward studying gene> and hence 
protein, function and regulation by aaalyz- 
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ing mRNA expression profiles, gene dis- 
ruption phenotypes, two-hybrid interac- 
tions, and protein subcellular localization 
(2). Although these studies, are useful, 
much information about protein function 
can be derived from the analysis of bio- 
chemical activities (3-7). In principle, the 
biochemical activities of proteins can be 
systematically probed by producing pro- 
teins in a high-throughput fashion and an- 
alyzing the functions of hundreds or thou- 
sands, of protein samples in parallel using 
protein microarrays (5, 6, 8), Major hurdles 
in screening an entire proteome array have 
been the ability to generate the necessary 



expression clones and also the expression 
and purification of proteins in a high- 
throughput fashion. . 

We have constructed a yeast proteome 
mieroarray containing approximately 80% 
yeast proteins and screened it for a number of 
biochemical activities. We first built a high- 
quality collection of 5800 yeast open reading 
frames (ORFs) (93.5% of the total) cloned 
into a yeast high-copy, expression vector us- 
ing recombination cloning (9). The yeast pro- 
teins are fused, to glutathione S-transferase 
polyhistidine (GST-HisX6) at their KH 2 -ter- 
mini and. expressed, in yeast using the. indue- . 
ible GALL promoter (5, 9). The yeast 
expression strains contain individual plas- 
oiids in which the correct yeast ORFs have 
been shown to be fused m-frarne to GST by 
DN.A sequencing. The proteins were ex- 
pressed in yeast to help ensure; that the pro- 
teins were modified arid folded prbperly. Us- 
ing a 96rwell ' format, .1152 samples, were 
purified at. once from yeast extracts using 
glutaihibne^garose •6eadV</0. : -We included 
0.1% Triton in the lysis hufTcr and washes to 
ensure that the. purified, proteins were free of 
lip ids . The quality : and quantity of the puri- 
fied proteins were monitored using immuno- 
blot analysis of 60 random samples (Pig. 1 A). 
Greater than 8Q% of. the strains produced 
detectable amounts of fusion -proteins; of the 
expected moleciiiar weight. 

To prepare the proteome chips, we printed 
6566 protein preparations, representing 5800 
different yeast proteins, in duplicate onto 
glass slides using a commercially available 
microarrayer. Our initial experiments used 
aldehyde-treated microscope . slides (6) in 



Fig. 1. GST::yeast pro- 
teins were purified in a 
96-well format (A) Sixty 
samples were examined 
by rmmunoblot analysis 
using anti-GST;. 19 repre- 
sentative examples are 
shown. Greater than 
8096 of the preparations 
produce high yields of fu- 
sion protein. (B) 6566 
protein samples repre- 
senting 5800 unique pro- 
teins were spotted in du- 
plicate on a single nickel- 
coated microscope slide. 
The slide was probed 
with anti-GST {10). (C) 
An enlarged Image of one 
of the 48 blocks is de- 
picted to the right of the 
proteome chip. 
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which fusion proteins attach to the surface 
through primary amines at their NH 2 - termini 
or other residues of the protein. In subsequent 
experiments, wc spotted proteins onto nickel- 
coated slides, in which the fusion proteins 
attach through their HisX6 tags and presum- 
ably uniformly orient away from the surface. 
Although both slides were successful, the 
nickel-coated slides gave superior signals for 
our particular protein preparations (Fig. IB). 

To determine how much fusion protein 
was covalcnlly attached to different glass sur- 
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Fig. 2. (A) Examples of diffeVent assays on the 
proteome chips. Proteome -chips containing 
6566 yeast proteins were spotted in duplicate 
arid incubated with the biotinylated probes in- 
dicated. The posrtiye signals, in duplicate 
(green) are in the bottom row of each panel; 
the top row of each panel shows the same 
yeast protein preparations of a control pro- 
teome chip probed with anti-GST (red). The 
upper panelshpws the. amounts of CST fusion 
proteins as detected by the antirGST (red). (B) 
A putative calmodulin-binding motif (32) is 
shown, which was identified by searching for 
amino add sequences that are shared by the 
different calmodulin targets (70)." Fourteen of 
39 positive proteins share a motif whose con- 
sensus is ,(l/l)QXK(K/X)GB, where X Is any 
residue and B is a basic residue. The size of the 
letter indicates the relative frequency of the 
amino acid indicated. 
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faces and the reproducibility of the protein 
attachment, we probed the chips with anti- 
bodies to GST (anti-GST). More than 93.5% 
of the protein samples gave signals signifi- 
cantly above background (i.e., greater than 10 
fg of protein), and 90% of the spots contained 
10 lo 950 fg of protein. Our results also 
demonstrate that it is feasible to spot, with 
excellent resolution, 13,000 protein samples 
in one-half the area of a standard microscope 
slide (Fig. 1C). To test the reproducibility of 
the protein spotting, we compared the signals 
from each pair of duplicated- spots with one 
another; 95% of the signals were within 5% 
of the average (10). 

The proteome chips were tested by 
probing for several protein-protein interac- 
tions and protein-lip id interactions. To test 
for protein-protein interactions, the yeast 
proteome was probed with biotinylated cal- 
modulin in the presence of calcium (II). 
Calmodulin is a highly conserved calcium- 
binding protein, involved in many calcium- 
regulated cellular processes, and has many 
known partners (12), The bound. biotiny- 
lated protein was detected using Cy3-la* 
beled streptavidin. As a control, we also 
probed with Cy3-labeled . streptavidin . 
alone. These studies identified six known 
calmodulin targets (Fig. 2 A): Cmklp and 
Cmk2p are the type 1 and II calcium/cal- 
modul in-dependent serine/threonine pro- 
tein kinases (12), Cmp2p is one of the two 
yeast calcineurins (IS), Dstlp plays a role 
in transcription elongation (N) y Myo4p is a 
class V myosin heavy chain (1 J), and 
Arc35p is a component of the Arp2/3 actin- 
organizing complex (16). Arc35p was re- 
cently shown to interact with calmodulin in 
a two-hybrid study (77); thus, our data 
confirm that Arc35p and calmodulin inter- 
act in vitro. Of the six known calmodulin 
targets that we did not detect, two are not in 
our collection and the rest were not detect- 
able in the GST probing experiments. In 
addition to known partners, the calmodulin 
probe identified 33 additional potential 
partners. These include many different 
types of proteins [supplementary table 1 
(10)}, consistent with a role for calmodulin 
in many diverse cellular processes. 

Sequence searching (J) revealed that 1 4 of 
the 39,calrjnodulin-binding proteins contain a 
motif whose consensus is (I/L)QXXK(K/ 
X)GB, where X is any residue and B is a 



basic residue (Fig. 28). A related sequence in 
myosins, 1QXXXXKXXXR, has been shown 
previously lo bind calmodulin (IS). Thus, we 
demonstrate that the domain is found in many 
calmodulin-binding proteins. Presumably the 
other targets that lack this motif have other 
calmodulin-binding sequences (10). 
: In addition to the calmodulin-binding tar- 
gets, we also identified one protein, Pyclp. 
that bound Cy3-Iabeled streptavidin. Pycip 
encodes a pyruvate carboxylase 1 homolog 
that contains a highly conserved biotin at- 
tachment region (19). Thus, as predicted by 
its sequence, Pyclp is biotinylated in vivo. 
With appropriate detection assays* we expect 
that proteome chips can identify many types 
of posttranslational modification of proteins. 

To test whether proteome chips could be 
used to identify activities that might not be 
accessible by other approaches, such as protein- 
drug interactions and protetn^lipid interactions, 
we screened for phosphoinositidc (P£H binding 
proteins. Pis are important constituents of cel- 
lular membrane and also serve as second-mes- 
sengers that regulate diverse cellular processes, 
including growtli, differentiaripn, cytoskclctal 
rearrangements, and membrane trafficking 

(20) . Because they are often present only tran- 
siently and in low abundance within cells, Pis 
have not been characterized extensively, and 
little is known about which proteins bind dif- 
ferent phospholipids (20). 

Five types of PI liposomes and one lipo- 
some lacking Pfs were used to probe the 
proteome chips. Each contains phosphatidyl- 
choline (PC) with 1% (w/w) //-(biotinoyl)- 
l,2-dihcxadecanoyl-j/i-glycero-3-phospho- 
ethanolamine, tricthylarnmonium salt (biotin 
DHPE): the biotinylated lipid serves as a 
label that can be detected by Cy3-streptavidin 

(21) . In addition to PC, the five other lipo- 
somes contain either 5% (w/w) PI(3)P t 
PI(4)P, PI(3,4)P^ P1(4,5)P 2 , or P1(3A5)P 3 
(Fig. 2A). All of these phospholipids have 
been found in yeast except Pf(3,4,5)P 3 (20). 

The six liposomes identified a total of 150 
different protein targets that produced signals 
significantly higher than the background; an 
algorithm was devised to assist in the identi- 
fication of positive signals (22). Fifty-two 
(35%) of the lipid-binding proteins corre- 
spond to uncharactcrized proteins. Of the 98 
known proteins, 45 proteins are membrane- 
associated and cither have, or are predicted to 
have, membrane-spanning regions (23, 24). 



Fig. 3. Analysis of, the PI lipid-binding proteins; To determine the Pl-bihding specificity of 150 
positive proteins, their binding signals were normalized against the corresponding binding signals 
of PC, On the basis of the ratios (PI/PC), the proteins were grouped into four categories: (A) 30 
strong and specific, (B) 43 strong and nonspecific, (C) 19 weak and specific, and (D) 58 weak and 
nonspecific Pl-bindihg proteins. The green color intensity represents the PI/PC signal ratio as shown 
by the scale In the figure; The first column to the right of the PI/PC binding ratios indicates the 
maximum binding signal intensity (open boxes) and its confidence interval (solid horizontal lines); 
the numbers indicate the log of the values. Blue, yellow, light-yellow, and red boxes in columns to 
the right of confidence interval column .identify membrane-associated proteins, protein kinases, 
other kinases, and uncharacterized ORFs, respectively. 
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This includes integral membrane proteins, 
those with lipid modifications [e.g., the gly- 
cosyl phosphatidyl inositol (GP1) anchor pro- 
teins Tos6p and Sps2p (23) and prenylated 
proteins (Gpa2p and the mating plieromone 
a-factor) (25)], as well as peripherally asso- 
ciated proteins [e.g., Kcc4p and Myo4p (15, 
26)], Eight others are involved in lipid me- 
tabolism (e.g., Bpllp) or inositol ring phos- 
phorylation (e.g., Kcslp) or are predicted to 
be involved in membrane or lipid function 
(e.g. 5 Ylr020cp has homology to triacyl glyc- 
erol lipase). Of the 52 uncharactcrizcd pro- 
teins, 13 (25%) are predicted to be associated 
with membranes (24) and others contain ba- 
sic stretches,, as might be expected for elec- 
trostatic interactions with negatively charged 
lipids.. Surprisingly, 19 of the Hpid-binding 
proteins are kinases, and 17 of these arc 
protein kinases. . 

The phosphblipid-binding proteins were 
sorted into whether they bound lipids strong- 
ly or weakly, on the basis of the phosphop- 
lipid-bincjing signal relative to : the. amount of 
GST (Fig. 3)<i2). We found that more (72%) 
of the strong ilipid-binding proteins (Fig. 3, A 
and B) were characterized relative to the 
weakly binding proteins (54%) {Fig. 3, Cand 
D) and more strong lipid-binding proteins are 
known or predicted to be membrane-associ- 
. atcd, relative to the weaker binding proteins 
(Fig. 3, "Membrane" column). Interestingly, 
13 of 17 of the protein kinases bind very 
strongly to the Pis. We further grouped the 
proteins by whether they preferentially bound 
one or more Pis over PC. One-hundred and 
one proteins bound to PC as well as or nearly 
as well as to the Pis (PI/PC < U) (Fig. 3, B 
and D). However, 49 proteins bound to one or 
more Pis preferentially (PI/PC > 1.3) (Fig. 3, 
A and C). Analysts of the strong Pi-binding 
proteins revealed that many of them specifi-: 
cally bound particular Pis. For example, 
Stp22p, which is required for vacuolar target- 
ing of plasma membrane proteins such as 
Ste2p and Canlp y preferentially binds PI(3)P 
(27), Nine protein kinases specifically bind 
PI(4)P and Pl(3 t 4)P 2 strongly and one binds 
these lipids weakly. Atplp, a subunit of the 
Fl-ATP synthase of the mitochondrial inner 
membrane, preferentially binds PI(4,5)P 2 



REPORTS 

(2<b"). Sps2p, which is localized to the pros- 
pore membrane (29), also interacts specifical- 
ly with Pl(3,4)P 2 . Preferential binding of 
Myo4p to PI(3,4)P 2 may be important for its 
interaction at the cell cortex and/or its regu- 
lation. No strong lipid-binding targets were 
found that specifically bound PI(3,4,5)P 3 , al- 
though some proteins bound both this lipid 
and others (Fig. 3). These results demonstrate 
that many membrane-associated proteins, in- 
cluding integral membrane proteins and pe- 
ripherally associated proteins, preferentially 
bind specific phospholipids in vivo. 

Several proteins involved in glucose me- 
tabolism were identified as phospholipid- 
binding proteins. This includes (i) three en- 
zymes involved in sequential glycolytic steps 
[phosphoglycerate rautase (CJpm3p), enoluse 
(Eno2pX and pyruvate kinase (Cdcl?p/ 
Pyklp)], (it) hexolcinase (Hxklp), and (iii) 
two protein kinases (Snflp and Riml5p). Al- 
though unexpected, previous studies indicate 
that some of these might interact wilh lipids. 
Hxklp binds zwitterrion micelles, which 
stimulate its activity (30), and Eno2p is se- 
creted, suggesting an interaction with mem- 
branes (J/). We speculate that cither phos- 
pholipids regulate steps involved in glucose 
metabolism or many steps of glucose metab- 
olism occur on membrane surfaces. In the 
latter case, the phospholipids would serve as 
a scaffold to efficiently carry out glycolytic 
steps. 

Six proteins not expected to be involved 
in membrane function or lipid signaling, 
Riml5p, Eno2p, Hxklp, Spslp, Ygl059wp, 
and Gcn2p, were further tested for Pl-binding 
using two types of standard assays (30). For 
three proteins, Riml5p, Eno2p t and Hxktp, 
PI(4,5)P 2 liposomes were first adhered to a 
nitrocellulose membrane; different amounts 
of the GST fusion proteins and a GST control 
were used to probe the membrane, and bound 
proteins were detected using anti-GST. As 
shown in Fig. 4A, each yeast fusion, protein 
tightly bound Pl(4,5)P 2> whereas GST alone 
did not. We also carried out the reverse assay 
for GST fusion proteins of RimISp, Spslp, 
Ygl059wp, and Gcn2p (30). Different 
amounts of these purified proteins were spot- 
ted' onto nitrocellulose and probed with the 



six different liposomes (Fig. 4B); the bound 
liposomes were detected using a horseradish 
peroxidase (HRP)-conjugated slrepravidin. 
As with the microarrays, liposomes bound to 
each protein, but not ihe bovine serum albu- 
min control. Spslp bound all five Pl-contain- 
ing liposomes nearly equally. Riml5p, 
Gcn2p ? and Ygl059wp exhibited different af- 
finities to different liposomes (see Fig. 4B for 
Riml5p); PI(3)P, PI(4)P, and PI(3,4)P 2 
bound strongest. In each case, a linear corre- 
lation between the binding signal and the 
level of protein was revealed (10). In sum- 
mary, these results demonstrate that Pl-bind- 
ing proteins identified in the proteome array 
also bind lipids in conventional assays. 

One concern about our experiments is that 
because proteins are purified from yeast, we 
might detect indirect interactions through as- 
sociated proteins. Most of the interactions 
that we detect arc expected to be directly or at 
least tightly associated with the protein of 
interest, because proteins were prepared us- 
ing stringent .conditions, and for the seven 
samples examined, contamiiiating bands 
were not detected using Coomassie staining. 
Another limitation is that properly folded ex- 
tracellular domains and secreted proteins are 
lifcely to be underrepresented in our collec- 
tion, because GST and a HisXfi tag are nised 
at the NH 2 -terrninuS: Thus, proteins with a 
signal peptide may not be delivered to the 
secretory pathway and may not be folded or 
modified properly, although we did detect 
three signal peptide-contaihing proteins, sug- 
gesting that at least some are produced and 
contain functional domains. Another limita- 
tion is that not alt interactions are detected 
because not all proteins are readily overpro- 
duced and purified in this high-throughput 
approach; we expect that 80% of the arrayed 
yeast proteins are full length and at reason- 
able levels for screening. 

Regardless, the use of proteome chips has 
significant advantages over existing ap- 
proaches; Random expression libraries are 
incomplete, the clones are often not full 
length, and the libraries are tedious to screen. 
A recent alternative approach is to generate 
defined arrays and screen them using a pool- 
ing strategy (4). The. pooling strategy re- 
quires two steps, the actual mimbcr of pro- 
teins screened is not known, and the method 
does not work well when large numbers of 
reacting proteins exist, because each, pool 
will test positive. Another method for detect- 
ing interactions is the two-hybrid approach 
(2), in which interactions are typically detect- 
ed in the nucleus, thus limiting the types of 
interactions that can be delected. The advan- 
tage of the proteome chip approach is that a 
comprehensive set of individual proteins can 
be directly screened in vitro for a wide vari- 
ety of activities, including protein-drug inter- 
actions, protehvlipid interactions, and enzy- 



Fig. 4. Conventional methods confirm 
protein-lipid interactions detected by 
the proteome microarrays (30). (A) 
PI(4,5)P 2 liposomes were first adhered 
to a nitrocellulose membrane; a dilution 
series of RlrnlSp, Eno2p, and Hxklp, 
and a GST control were used to probe 
the membrane. The bound proteins 
were detected using anti-GST and an 
ECL kit. (B) A reverse assay was carried 
out to test potential protein-Upid inter- 
actions. The proteins were prepared and 

spotted onto nitrocellulose filters in a dilution series and probed with the six different liposomes. 
As a control, the six liposomes were also added to the membrane. After extensive washing, the 
bound liposomes were detected using an HRP-conjugated streptavidin and an ECL kit 
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matic assays using a wide range of in vitro 
conditions. Furthermore, once the proteins 
are prepared, proteome screening is signifi- 
cantly faster and cheaper. Using similar pro- 
cedures, it is clearly possible to prepare pro- 
tein arrays of 10 to 100,000 proteins for 
global proteome analysis in humans and other 
eukaryotes. 
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An fMRI Investigation of 
Emotional Engagement in Moral 
Judgment 

Joshua D, Greene, 1 - 2 * R. Brian Sommerville, 1 Leigh E. Nystrom, 1 3 
John M. Dariey, 3 Jonathan D. Cohen 1 * 3 - 4 

The long-standing rationalist tradition in moral psychology emphasizes the role 
of reason in moral judgment. A more recent trend places increased emphasis 
on emotion. Although both reason and emotion are likely to play important 
roles in moral judgment relatively little is known about their neural correlates, 
the nature of their interaction, and the factors that modulate their respective 
behavioral influences in the context of moral judgment. In two functional 
magnetic resonance imaging (fMRI) studies using moral dilemmas as probes, we 
apply the methods of cognitive neuroscience to the study of moral judgment 
We argue that moral dilemmas vary systematically in the extent to which they 
engage emotional processing and that these variations in emotional engage- 
ment influence moral judgment. These results may shed light on some puzzling 
patterns in moral judgment observed by contemporary philosophers. 



The present study was inspired by a family of 
ethical dilemmas familiar to contemporary 
moral philosophers (/). One such dilemma is 
the trolley dilemma: A runaway trolley is 
• headed for five people who will be killed if it 
proceeds on its present course. The only way 
to save them is to bit a switch thai wiil turn 
the trolley onto an alternate set of tracks 
where it will kill one person instead of five. 
Ought you to turn the trolley in order to save 
five people at the expense of one? Most 
people say yes. Now consider a similar prob- 
lem, the footbridge dilemma. As before, a 
trolley threatens to kill five people. You are 
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standing next to a large stranger on a foot- 
bridge that spans the tracks, in between the 
oncoming trolley and the five people, in this 
scenario, the only way to save the five people 
is to push tlu's stranger off the bridge, onto the 
trades below. He will die if you do this, but 
his body will stop the trolley from reaching 
the others. Ought you to save the five others 
by pushing this stranger to his death? Most 
people say no. 

Taken together, these two dilemmas cre- 
ate a puzzle for moral philosopher: What, 
makes it morally acceptable to sacrifice one 
life to save five in the trolley dilemma but not 
in the footbridge dilemma? Many answers 
have been proposed. For example, one might 
suggest, in a Kantian vein, that the difference 
between these two cases lies in the fact that in 
the footbridge dilemma one literally uses a 
fellow human being as a means to some 
independent end, whereas in the trolley di- 
lemma the unfortunate person just happens to 
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Summary 

The hypothalamus plays a central role in the integrated 
control of feeding and energy homeostasis. We have 
identified two novel neuropeptides, both derived from 
the same precursor by proteolytic processing, that 
bind and activate two closely related (previously) or- 
phan G protein-coupled receptors. These peptides, 
termed orexin- A and -B, have no significant structural 
similarities to known families of regulatory peptides. 
prepro-orexin mRNA and immunoreactive orexin- A 
are localized in neurons within and around the lateral 
and posterior hypothalamus in the adult rat brain. 
When administered centrally to rats, these peptides 
stimulate food consumption, prepro-orexin mRNA 
level is up-regulated upon fasting, suggesting a physi- 
ological role for the peptides as mediators in the cen- 
tral feedback mechanism that regulates feeding be- 
havior. 



Introduction 

Members of the family of seven-transmembrane, G pro- 
tein-coupled cell surface receptors (GPCRs) respond to 
a wide variety of signals, including photons, amines, 

7 To whom correspondence should be addressed. 

8 Present address: Institute of Basic Medical Sciences, University of 

Tsukuba, Ibaraki 3050006, Japan. 

"Present address: First Department of Surgery, Osaka University 
Medical School, Osaka 5650871, Japan. 

"Present address: Institute of Community Medicine, University of 
Tsukuba, Ibaraki 3050006, Japan. 



lipids, peptides, and proteases. Ligand binding initiates 
intracellular signal transduction through the activation 
of heterotrimeric G proteins. All of the known small regu- 
latory peptides (small peptide hormones and neuropep- 
tides) exert their biological actions by acting on GPCRs. 
Recent efforts in genomics research have identified a 
large number of cDNA sequences that encode "orphan" 
GPCRs, i.e., putative GPCRs without known cognate 
ligands. Many of these orphan GPCRs are likely to be 
receptors for heretofore unidentified signaling mole- 
cules, including new peptide hormones and neuropep- 
tides. GPCRs already represent the largest class of tar- 
get molecules for drugs available in the clinic (Hardman 
et al. ( 1996). The orphan GPCRs therefore represent a 
fruitful resource for drug discovery (Stadel et al. ( 1997). 

To approach these possibilities, we undertook a sys- 
tematic biochemical search for endogenous peptide li- 
gands for multiple orphan GPCRs, using a cell-based 
reporter system. These screening experiments led to 
the identification of a novel family of neuropeptides that 
bind to two closely related orphan GPCRs. We call these 
peptide ligands "orexins," after the Greek word orexis, 
which means appetite. The mRNA for the precursor of 
these peptides is abundantly and specifically expressed 
in the lateral hypothalamus and adjacent areas, a region 
classically implicated in the central regulation of feeding 
behavior and energy homeostasis (Oomura, 1980; Ber- 
nardis and Bellinger, 1993, 1996). These peptides stimu- 
late food consumption when administered centrally, and 
their production is influenced by nutritional state of the 
animal. Here we report the identification and initial bio- 
logical characterization of two orexins as well as their 
two receptors. 

Results and Discussion 

Purification and Structure of Orexins 

Our general strategy was to screen high-resolution HPLC 
fractions of various tissue extracts for GPCR-agonist 
activity, using multiple orphan GPCR-expressing cell 
lines tested in parallel. For this purpose, we generated 
over 50 stable transfectant cell lines, each expressing 
a distinct orphan GPCR cDNA, which can be used as 
reporter cell lines in order to identify endogenous ligands 
for these receptors. We challenged the cells with HPLC 
fractions derived from tissue extracts and we monitored 
a number of signal transduction read-outs for hetero- 
trimeric G protein activation. A potential problem in this 
approach is the presence of numerous endogenous 
GPCRs in the host cell line. To circumvent this problem 
in part, we assayed active fractions against at least 2-3 
independent transfectant cell lines each expressing a 
different orphan GPCR. This enabled us to search for 
active fractions that activate only cells that express a 
specific receptor. 

In this process, we discovered several reverse-phase 
HPLC fractions of rat brain extracts (Figure 1) that elicit 
a robust increase of cytoplasmic Ca 2+ levels ([Ca 2+ ]j) 
in HEK293 cells expressing an orphan GPCR originally 
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Figure 1. Purification of Endogenous Ligands for the Orphan Receptor HFGAN72 (OX,R) 

(A) Activation of the HFGAN72 receptor in transfected HEK293 cells by reverse-phase HPLC fractions of crude peptide extracts from rat brain. 
Top panel, chromatogram. Acetonitrile gradient and absorbance of eluted materials are indicated by solid and dotted lines, respectively. 
Middle and bottom panels, [Ca 2 "l transients in two HEK293 cell lines stably expressing unrelated orphan receptors HFGAN72 and HPRAJ70, 
respectively. Peak increments in (Ca 2+ ] f evoked by designated HPLC fractions are normalized against the response induced by 100 nM 
endothelin-1 in the same reporter cell preparations and plotted as relative activity. Note that a number of fractions induced similar responses 
in both cell lines, presumably by acting on endogenous receptors in the host HEK293 cells. However, fractions 60-61 {marked as A; containing 
orexin-A) induced robust [Cs? + l transients in HFGAA/72-transfected cells but not in HPRAJ 70-tra nsfected cells. Fractions 56 and 58 also 
exhibited weaker but distinct activities specific to HFGAN72 (marked as B and B'; containing orexin-B and orexin-B[3-28], respectively). 

(B) Cation exchange HPLC of fractions 60-61 from Figure 1A. Bars show normalized magnitude of [Ca 2+ ]| in HFG^A/72-expressing HEK293 
cells as in Figure 1A. 

(C) Reverse-phase HPLC of active fractions from Figure 1 B. 

(D) Final purification by reverse-phase HPLC at 40°C of active fractions in Figure 1C. Arrow, final orexin-A peak. 



termed HFGAIM72. This receptor was initially identified 
as an expressed sequence tag (EST) from human brain 
(see Figure 2C). A major activity peak (marked "A" in 
Figure 1 A) was reproducibly accompanied by two addi- 
tional minor peaks of activity (B and B' in Figure 1A). 
These activities appeared to be specific for the HFGAN72 
receptor, since the active fractions failed to evoke de- 
tectable [Ca 2+ ]j transients in cells expressing an unre- 
lated orphan receptor, HPRAJ70, in a parallel assay (Fig- 
ure 1A). These activities seemed to be peptidic, since 
protease treatment of the active fractions destroyed the 
activity. 

We purified the major activity to homogeneity from 



acid-acetone extracts of rat brains by collecting active 
fractions after four steps of HPLC (Figure 1). The final 
purified material, corresponding to the absorbance peak 
at the arrow in Figure 1 D, was subjected to structural 
analyses by limited protease digestions followed by 
nanoelectrospray tandem mass spectrometry, Edman 
degradation, and MALDI mass spectrometry (see Exper- 
imental Procedures for details). The active substance, 
termed orexin-A, was found to be a 33-amino acid pep- 
tide of 3,562 Da, with an N-terminal pyroglutamyl residue 
and C -terminal amidation (Figure 2A). Molecular mass of 
the peptide as well as its sequencing analyses indicated 
that the four Cys residues of orexin-A formed two sets of 
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Figure 2. Amino Acid Sequences of Orexins and Orexin Receptors 

(A) Structures of mature orexin- A and -B peptides. The topology of the two intrachain disulfide bonds in orexin-A is indicated above the 
sequence. The N-terminal pyroglutamyl residue (<E) in orexin-A is depicted. Amino acid identities are indicated by boxes. Asterisks indicate 
that human and mouse sequences are deduced from the respective genomic sequences and not from purified peptides, 

(B) Deduced amino acid sequences of prepro-orexin precursor polypeptides. Orexin-A and -B sequences are boxed. Secretory signal sequences 
predicated by the SignalP Server (http://www.cbs.dtu.dk/services/SignaIP/) are marked by equal signs (=); prohormone convertase cleavage 
and amidation sites by asterisks ('). Interspecifically identical residues are indicated by vertical bars. The full-length nucleotide sequences of 
human, rat, and mouse prepro-orexin cDNAs have been submitted to GenBank (accession numbers AF041240, AF041241, and AF041242). 

(C) Deduced amino acid sequences of rat (r) and human (h) OX, and OXj receptors. Amino acid residues that are identical in all four sequences 
are boxed. Putative transmembrane (TM) domains are marked above the aligned sequences, as predicted by the PredictProtein server (http:// 
www.embl-heidelberg.de/predictprotein). Gaps introduced to obtain optimal alignment are indicated by dashes. The full-length nucleotide 
sequences of human and rat orexin receptor cDNAs have been submitted to GenBank (accession numbers AF041243, AF041244, AF041245, 
and AF041246). 



intrachain disulfide bonds. The topology of the disulfide 
bonds was determined by selectively synthesizing the 
peptides with different disulfide topologies by using se- 
lective deprotection and sequential oxidation of two of 
the four Cys residues and then comparing these syn- 
thetic peptides with purified natural orexin-A in isocratic 
reverse-phase HPLC. The topology was determined to 
be [Cys6-Cys12, Cys7-Cys14] (Figure 2A). Using similar 
protocols, we also purified the analogous activity from 
bovine hypothalamic extracts, and we found that the 
structure of bovine orexin-A was identical to that of the 
rat peptide. 

In addition to the major peak of activity, the HPLC 
fractions contained two minor activity peaks that we 
designated B and B' (Figure 1A). We isolated peptides 
responsible for both of these activities from rat brain 
extracts using similar purification protocols (see Experi- 
mental Procedures). Mass spectrometry and Edman se- 
quencing showed that peak B contained a 28-amino 
acid, C-terminally amidated linear peptide of 2,937 Da. 



The peptide, termed orexin-B, was 46% (13/28) identical 
in sequence to orexin-A (Figure 2A). Peak B' contained 
an N-terminally truncated [des Arg-Pro]orexin-B, which 
we designate orexin- B[3-28]. Further investigation is re- 
quired to determine whether this truncation represents 
a physiologically relevant processing or whether it is a 
result of an artifactual proteolysis during the extraction 
procedure. In the subsequent studies described below, 
only orexin-A and full-length orexin-B were subjected 
to further biological characterization. 

Structure of the Prepro-Orexin Precursor 

The amino acid sequences of purified orexin-A and -B 
exhibited no meaningful similarity to any known pep- 
tides. A BLAST search of the GenBank database also 
failed to detect an entry that contains these or similar 
sequences. Therefore, we sought to isolate the cDNA 
encoding the precursor polypeptide; we first obtained 
a cDNA fragment encoding a part of orexin-A by RT-PCR 
of rat brain mRNA, using highly degenerate primers 
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Figure 3. Pharmacological Characterization 
of Synthetic Human Orexins on Human Or- 
exin Receptors Expressed in Stably Trans- 
fected CHO Cells 

(A and B) Competitive radioligand binding 
assays. Displacement of r 2S l-Tyr1 7]orexin-A 
binding to cells expressing human OX,R (A) 
and OX 2 R(S) by increasing concentrations of 
unlabeled orexin-A (open circle) and human 
orexin-B (filled circle) are determined in qua- 
druplicates. Level of nonspecific binding was 
approximately 20% of the binding in the ab- 
sence of competitor. 

(C and D) Dose-response relationships of 
[Ca I+ l transients evoked by orexin-A (open 
circle) and human orexin-B (filled circle) in 
cells expressing human OXjR (C) and OX 2 R 
(D). Peak [Ca 2+ 1 values are determined in du- 
plicates. 
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based on parts of the orexin-A sequence. We then per- 
formed 5'-RACE and 3'-RACE reactions to obtain the 
full-length cDNA. We found that this cDNA encoded 
both orexin-A and -B (Figures 2A and 2B). 

The 5'-mostATG codonofthe 585 bpcDNA(GenBank 
accession number AF041241) was preceded by an in- 
frame stop codon, and the sequence around this initia- 
tion codon conformed to Kozak's rules. The open read- 
ing frame starting with this ATG encodes a 1 30-residue 
polypeptide, the rat prepro-orexin (Figure 2B). The first 
33 amino acids exhibited characteristics of a secretory 
signal sequence: a hydrophobic core followed by resi- 
dues with small polar side chains (von Heijne, 1986). 
The SignalP Server web site predicted that Ala32-Gln33 
was the most likely site for signal sequence cleavage. 
The orexin-A sequence starts with Gln33, which is pre- 
sumably cyclized enzymatically into the N-terminal py- 
roglutamyl residue by transamidation (Busby etal., 1987; 
Bateman etal., 1990). Thus, the mature peptide directly 
follows the signal peptide cleavage site. The last residue 
of the mature peptide is followed by Gly66, which pre- 
sumably serves as NH 2 donor for C-terminal amidation 
by sequential actions of peptidylglycine monooxygenase 
and peptidylamidoglycolate lyase (Bradbury and Smyth, 
1 991 ; Eipper et al., 1 993). As expected, Gly66 is followed 
by a pair of basic amino acid residues, Lys67-Arg68, 
which constitute a recognition site for prohormone con- 
vertases (Rouille et al., 1995). The next segment of the 
deduced prepro-orexin sequence, Arg69-Met96, was 
identical to the sequence of purified orexin-B. The Met96 



residue is again followed by Gly-Arg-Arg, a C-terminal 
amidation signal, consistent with our finding that orexin-B 
is also C-terminally amidated. 

We isolated human and mouse genomic fragments 
containing the prepro-orexin gene in order to predict 
the sequences of the human and mouse prepro-orexin 
polypeptides. Details of the gene structure will be re- 
ported elsewhere. The predicted human and mouse or- 
exin-A sequences were both identical to rat and bovine 
orexin-A (Figure 2A). Mouse orexin-B was also predicted 
to be identical to rat orexin-B. Human orexin-B had 
two amino acid substitutions when compared with the 
rodent sequence (Figure 2A). Overall, the human and 
mouse prepro-orexin sequences were 83% and 95% 
identical to their rat counterparts, respectively. A ma- 
jority of amino acid substitutions were found in the 
C-terminal part of the precursor, which appears unlikely 
to encode for another bioactive peptide. 

Radiation hybrid mapping showed that the human 
prepro-orexin gene is most tightly linked to the MIT STS 
markers WI-6595 and UTR9641. The inferred cytoge- 
netic location between these markers is 1 7q21 . Interest- 
ingly, the localization at chromosome 17q21 raises the 
possibility that the prepro-orexin gene may be a candi- 
date gene for a group of neurodegenerative disorders 
collectively called "chromosome 17-linked dementia" 
(Wilhelmsen, 1997), including nosological entities such 
as disinhibition-dementia-parkinsonism-amyotrophy com- 
plex (DDPAC; MIM No. *600274) and pallido-ponto- 
nigral degeneration (PPND; MIM No. *168610), which 
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may be caused by allelic mutations. Both DDPAC and 
PPND have recently been mapped to 17q21-22 (Wil- 
helmsen et al., 1994; Wijker et al., 1996). 

Characterization of Orexin Receptors 

Figure 2C shows the deduced amino acid sequences 
of the original HFGAN72 receptor, which we now call 
OX, receptor (OX^). Among various classes of GPCR, 
OX^ is structurally most similar to certain neuropeptide 
receptors, most notably to the Y2 neuropeptide Y (NPY) 
receptor (26% identity), followed by the TRH receptor, 
cholesystokinin type- A receptor, and NK2 neurokinin 
receptor (25%, 23%, and 20% identity, respectively). 
This is consistent with our hypothesis that OXiR is the 
receptor for orexins, another class of small regulatory 
peptides. In order to characterize further their pharma- 
cological interactions, we performed in vitro functional 
assays using transfected cell lines expressing the recep- 
tor. Mock transfected CHO cells did not exhibit detect- 
able levels of specific binding of radio-iodinated P 25 l- 
Tyr17]orexin-A. Stable transfection of CHO cells with an 
expression vector containing the human OXjR cDNA 
(CHO/OX,R) conferred the ability to bind r 25 l]orexin-A 
(Figure 3A). The radioligand binding was inhibited by nano- 
molar concentrations of unlabeled synthetic orexin-A in 
a dose-dependent manner, but not by any of several 
unrelated peptides tested, including human NPY and 
endothelin-1 at up to 10 p-M (data not shown). The con- 
centration of unlabeled orexin-A required to displace 
50% of specific radioligand binding (IC50) was 20 nM as 
calculated by the LIGAND program (Munson and Rod- 
bard, 1980). Orexin-A also induced a transient increase 
in [Ca 2+ ]j in CHO/OX^ cells in a dose-dependent man- 
ner (Figure 3C), but failed to induce detectable [Ca 2+ ]j 
transients in mock transfected CHO cells. We feel that 
the calcium mobilization is likely caused by the activa- 
tion of the Gq class of heterotrimeric G proteins (Hepler 
et al., 1994). The calculated concentration of orexin-A 
required to induce half-maximum response (EC M ) was 
30 nM. We obtained similar results in radioligand bind- 
ing and [Ca 2+ ]i transient assays performed with stably 
transfected HEK293 cells (data not shown). These find- 
ings confirm that orexin-A is indeed a specific, high- 
affinity agonist for OX,R. 

As expected from our purification experiments, syn- 
thetic human orexin-B also acted as a specific agonist 
on CHO/OXtR cells in a parallel set of experiments (Fig- 
ures 3 A and 3C). Interestingly, however, we found that 
human orexin-B has significantly lower affinity for the 
human OX, R: the calculated IC 50 in the competitive bind- 
ing assay and the EC M in the [Ca 2+ I transient assay were 
420 nM and 2500 nM for human orexin-B, respectively, 
indicating 2-3 orders of magnitude lower affinities as 
compared with orexin-A. Similar results were obtained 
with HEK293 cells expressing OX,R. 

The findings with orexin-B led us to suspect that there 
is an additional orexin receptor(s) for which orexin-B has 
high affinity. A BLAST (tblastn) search of the GenBank 
dbEST database with the OX,R amino acid sequence 
detected two highly similar EST entries: accession num- 
bers D81887 (human fetal brain cDNA) and W86548 (hu- 
man fetal liver/spleen cDNA). These EST sequences 
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Figure 4. Tissue Distribution of Rat prepro-orexin and Orexin Re- 
ceptor mRNAs 

A membrane containing ~2 p.g/lane of poly{A)^ RNA from indicated 
rat tissues (Clontech, cat. no. 7764-1) was sequentially rehybridized 
with designated cDNA probes using standard methods. Sizes of 
prepro-orexin, OXjR and OX^ mRNAs are approximately 0.7, 2.5, 
and 3.5 kb, respectively (arrowheads). A segment of rat prepro- 
orexin cDNA encoding Gln33-Ser128 (Figure 2B) as well as full- 
length rat OXjR and CWGffcDNAs (Figure 2C) were used as probes. 
These probes do not detectably cross-hybridize with each other 
under the high-stringency conditions used. Autoradiography was 
exposed for 5 days, 3 days, 3 days and 3 hr for prepro-orexin, OX^R. 
OX 2 R. and 0-actin probes. 



showed much higher similarities to the OX^R sequence 
than to numerous other GPCR entries. At the nucleotide 
sequence level, they differed enough from the human 
OX1R cDNA sequence to indicate that they are not de- 
rived from the OX 7 R gene itself. We subsequently found 
that these two ESTs were derived from 5' and 3' parts of 
the same transcript. Full-length cloning and sequencing 
showed that this cDNA encodes a GPCR, termed OX2R, 
that highly resembles OXtR (Figure 2C). The amino acid 
identity between the deduced full-length human OX n R 
and OX 2 R sequences is 64%. Thus, these receptors are 
much more similar to each other than they are to other 
GPCRs. Amino acid identities between the human and 
rat homologs of each of these receptors are 94% for 
OX,R and 95% for OX 2 R, indicating that both receptor 
genes are highly conserved between the species. 

To characterize functionally the OX 2 R further, we re- 
peated competitive radioligand binding assays and 
[Ca 2+ L transient dose-response studies using stably 
transfected CHO cells expressing the human OX 2 R 
cDNA. The results demonstrated that OX 2 R is indeed a 
high-affinity receptor for human orexin-B, with an IC 5 o 
of 36 nM in the binding assay and an EC50 of 60 nM in 
the [Ca 2+ ]j transient assay (Figures 3B and 3D). Orexin-A 
also had high affinity for this receptor, with 38 nM IC 50 
and 34 nM EC 50 values, which are similar to the values for 
orexin-B. Thus, we conclude that OX 2 R is a nonselective 
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Figure 5. Localization of prepro-orexin mRNA and Immunoreactive Peptide in Adult Rat Brain 

(A) Visualization of neurons containing prepro-orexin mRNA in the adult rat hypo- and subthalamic areas by in situ hybridization. Coronal 
section of brain hybridized with 3S S-labeled anti-sense riboprobe showing the bilateral and symmetrical distribution of labeled neurons. No 
detectable sgnal beyond background was generated by sense riboprobe. The yellow rectanoje designates the area schematized in Figure 5B. 

(B) prepro-orexin mRNA-containing neurons are shown in red superimposed upon anatomical structures of the hypo- and subthalamic areas 
(Paxinos and Watson, 1986): lateral hypothalamic area (LH), perifornical nucleus (PeF). posterior hypothalamic area (PH), subthalamic nucleus 
(Sth), subincertal nucleus (Subl), and ventral zona incerta (ZIV). Additional landmarks include: thalamus (THAL), habenular complex (HAB), 
internal capsule fjc), optic tract (opt), mammillothalamic tract (mt), fornix (0. medial forebrain bundle (mfb), third ventricle (3V), arcuate 
hypothalamic nucleus (Arc), dorsomedial hypothalamic nucleus (DM), and ventromedial hypothalamic nucleus (VMH). (Permission for use 
granted by Academic Press.) 
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receptor for both orexin-A and -B, while OXiR is selec- 
tive for orexin-A. 

In radiation hybrid mapping, the MIT markers showing 
tightest linkage to the human OX,R and OX 2 R genes are 
the STS markers D1 S1 95 and D1 S443, and WI-5448 and 
CHLC.GATA74F07, respectively. The inferred cytoge- 
netic locations between these markers are 1p33 for 
OXjR, and 6cen (p11-q11) for OX 2 R (accurate cytoge- 
netic locations are often difficult to interpret from radia- 
tion hybrid maps in which the gene lies near the cen- 
tromere). 

Tissue Distribution of Orexin and Orexin Receptors 

N -terminal pyroglutamyl cyclization (seen in orexin-A) 
as well as C-terminal amidation (both orexin-A and -B) 
are posttranslational modifications that are most often 
found in neuropeptides. Northern blot analysis of adult 
rat tissues showed that the 0.7 kb prepro-orexin mRNA 
is detected exclusively in the brain except, in a small 
amount, in the testis (Figure 4). In adult rats, OX,R and 
OX 2 £mRNAs were also detected exclusively in the brain 
(Figure 4). These findings are consistent with the idea 
that orexins are regulatory peptides that function within 
the central nervous system. 

Preliminary Northern blot analysis of human mRNA 
from multiple brain regions (human brain blots II and III, 
Clontech) demonstrated that the prepro-orexin mRNA 
was expressed abundantly in subthalamic nucleus, but 
undetectable in other brain regions tested (data not 
shown; hypothalamus was not included in these brain 
blots), prepro-orexin mRNA was also undetectable in 
the human heart, placenta, lung, liver, skeletal muscle, 
kidney, and pancreas (data not shown). These findings 
are consistent with the results of the in situ hybridization 
in the rat brain described below. 

Distribution of Orexin-Containing 
Neurons in Rat Brain 

To further localize orexin expression within the central 
nervous system, we performed in situ hybridization and 
immunohistochemical analyses in rat brains. In situ hy- 
bridization with a rat prepro-orexin cRNA probe in coro- 
nal sections showed orexin-containing neurons orga- 
nized bilaterally and symmetrically in a discrete set in 
hypothalamic and subthalamic areas of the adult rat 



brain (Figures 5A-5D). In the hypothalamus, positive 
neurons were found in the lateral (Figures 5A-5H) and 
posterior hypothalamic areas (Figures 5A-5F) and the 
perifornical nucleus (Figures 5C-5F). In the subthalamus 
(also called ventral thalamus), the zona incerta, subin- 
certal, and subthalamic nuclei were positive (Figure 5B). 
Both in situ hybridization histochemistry (Figures 5A and 
5C-5F) and immunoflu orescent cytochemistry labeled 
the cytoplasm of medium-sized neurons (Figures 5D- 
5H), showing identical distributions. The shapes of the 
neurons varied from thin and fusiform to robust and 
multipolar. It is noteworthy that no signal was detected 
in neurons of the paraventricular (not shown), ventrome- 
dial (Figure 5B), or arcuate nuclei (Figure 5B), which are 
known to contain a variety of neuropeptides associated 
with food consumption (Bernardis and Bellinger, 1996; 
Bing et al„ 1996; see below). 

Stimulation of Feeding Behavior by Centrally 
Administered Orexins 

The lateral hypothalamic area has been implicated in the 
regulation of feeding behavior and energy homeostasis 
ever since the classic experiments showing that animals 
with lateral hypothalamic lesions had decreased food 
intake and lower "set point" for body weight (Oomura, 
1980; Bernardis and Bellinger, 1993, 1996). The striking 
localization of orexin-containing neurons in the lateral 
hypothalamus and some of its adjacent areas suggested 
to us the possibility that the neuropeptide may be in- 
volved in the regulation of food intake. To evaluate this 
hypothesis, we administered orexin acutely into the lat- 
eral ventricle of male rats through preimplanted indwell- 
ing catheters. When a bolus was administered intracere- 
broventricularly in early light phase, orexin-A stimulated 
food consumption in a dose-dependent manner within 

1 hr (Figure 6, left). The magnitudes of stimulation with 
3 nmol and 30 nmol orexin-A the 2 hr time point were 
6- and 10-fold, respectively. The effect persisted at 4 h; 
the amount of food consumed during the interval from 
2-4 h postinjection was increased approximately 3-fold 
with either dose as compared to vehicle controls. Human 
orexin-B also significantly augmented food intake; at the 

2 hrtime point, we observed 5- and 12-fold stimulation of 
food consumption by 3 nmol and 30 nmol orexin-B, 
respectively, as compared with vehicle controls (Figure 



(C) Low magnification of section caudal to Figure 5A showing the distribution of prepro-orexin mRNA-containing neurons. Mammillothalamic 
tract (mt), third ventricle (3V), cerebral peduncle (cp), infundibular stem (Infs). There are two groups of neurons: one group is seen in the 
posterior hypothalamus (PH) and the other in the lateral hypothalamus (LH). The yellow recta gle indicates the field shown in Figure 5D. 

(D) Higher magnification of the neuronal cell bodies in Figure 5C. The yellow vertical line indicates the boundary separating the medial neurons 
of PH and the neurons of LH. 

(E) Low magnification of prepro-orexin mRNA-containing neurons seen in parasagittal section traversing the posterior hypothalamic area 
shown in Figure 5D. Cerebral cortex (CC), dentate gyrus (DG), splenium of the corpus callosum (sec), lateral ventricle (LV), caudate putamen 
(Cpu), corpus callosum (cc), septum (S), anterior commissure (ac), thalamus (THAL), optic chiasm (ox), third ventricle (3V), pineal gland (Pi), 
superior colliculus (SC), inferior colliculus (IC), cerebellum (Cb), and pontine nucleus (Pn). The yellow rectagle indicates the field magnified in 
Figure 5F. 

(F) High-magnification of the prepro-orexin-expre ssing neurons in the posterior hypothalamus (PH) shown in Figure 5E. 

(G) Orexin-A-immunoreactive neurons in the LH of a frozen parasagittal section. Dorsal aspect is to the left side of this panel. The yellow box 
corresponds to the field shown in Figure 5H. The staining of orexin-A-positive neurons was abolished in the presence of excess (100 nM) 
synthetic orexin-A. 

(H) High magnification of the immunoreactive orexin-containing neurons of the LH area. The boxed area of cells Figure 5G have been rotated 
90° clockwise. In general, most of the neurons are multipolar, but a few are fusiform. The nucleus is not immunopositive, indicating that orexin 
is cytoplasmic. Beaded neuronal processes are sometimes seen in these sections (arrowhead). 
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Figure 6. Stimulation of Food Consumption 
by Intracerebroventricular Injection of Or- 
exin-A and -B in Freely Fed Rats 
Designated amounts of synthetic human or- 
exin-A (left) or -B (right) were administered in 
a 5 p,l bolus through a catheter placed in the 
left lateral ventricle in early light phase. Cu- 
mulative food consumption was plotted over 
the period of 4 hr after injection. Asterisks 
O indicate significant difference from vehicle 
controls (p < 0.05, n = 8-10, AN OVA followed 
by Student-Newman-Keuls test). Crosses 0) 
designate significant difference between 3 
nmol and 30 nmol injections (p < 0.05, n = 
8-10, ANOVA followed by Student-Newman- 
Keuls test). Similar results were obtained in 
at least four independent sets of experiments. 
The same vehicle control curve wasreplotted 
in both panels. 
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6, right). The effects of orexin-B did not last as long as 
those of orexin-A; there was little stimulation of food 
intake after 2 hr even with the higher dose. These actions 
of orexin were less efficacious than NPY, a known orexi- 
genic neuropeptide (Gerald et al., 1996; Turton et al., 
1996), but longer lasting in the case of orexin-A; under 
the same conditions, we found that 3 nmol human NPY 
induced cumulative food consumption of 11.4 ± 2.0 g 
and 1 2.8 ± 1 .8 g (mean ± SEM) at 2 hr and 4 hr internals, 
respectively, representing 24- and 14-fold stimulation 
over vehicle controls. 

The shorter action of orexin-B as compared with or- 
exin-A might be related to the fact that orexin-B is a 
linear peptide with a free N terminus. In contrast, both 
termini of orexin-A are blocked by posttranslational 
modifications. Together with the two intra chain disulfide 
bonds, these modifications may render orexin-A resis- 
tant to inactivating peptidases. We cannot exclude the 
possibility that these time courses reflect bona fide dif- 
ferences in the biological actions of orexin-A and orexin-B 
in the central nervous system. The similar degrees of 
stimulation of food intake by both orexin-A and orexin-B 
suggest that OX 2 R, the nonselective receptor (see Figure 
3), may at least in part be involved in this in vivo pharma- 
cological action of orexins. However, more detailed in 
vivo dose-response studies, as well as a separate block- 
ade of these receptors either pharmacologically by se- 
lective antagonists or genetically by gene targeting, will 
be required to determine the receptor(s) involved. 

Up- Regulation of prepro-orexin mRNA 
in the Fasting State 

The stimulation of food intake by centrally administered 
orexins suggested that these neuropeptides may play 
a physiological role in the central regulation of feeding. 
To evaluate the possibility that orexin production and 
release may be affected by nutritional state, we com- 
pared the level of prepro-orexin mRNA expression in 
the hypothalamus of fed and fasted rats. Adult male rats 
were housed singly under 12 hr light-dark cycle for at 
least 5 days, and then either fed ad libitum (n = 5) or 
fasted for 48 hr starting at mid-light cycle (n = 5). At the 



end of the 48 hr period, rats were sacrificed and the 
thalamic/hypothalamic portion of the brain was dis- 
sected. Total RNA was extracted from individual sam- 
ples and subjected to quantitative Northern blot analy- 
sis. After the 48 hr fast, hypothalamic prepro-orexin 
mRNA was up-regulated 2.4-fold as compared with the 
fed control animals (Figure 7). NPY mRNA was pre- 
viously described to be up-regulated under similar 
fasted conditions (Qu et al., 1996). We also found that 
NPY mRNA was up-regulated, albeit to a lesser extent 
than the orexin mRNA (Figure 7A). 



Physiological Implications 

Quantum leaps in our understanding of the mechanisms 
involved in the energy homeostasis have resulted from 
the recent identification of the leptin signaling pathway 
(Zhang et al., 1994; Chen et al., 1996) and by additional 
molecular genetic studies in mice. These studies have 
revealed a number of central regulatory pathways medi- 
ated by various neuropeptides (Yen et al., 1 994; Erickson 
et al., 1996a; Huszar et al., 1997; Ohki-Hamazaki et al., 
1997). Maintenance of body weight is achieved by an 
intricate balance between energy intake (food consump- 
tion) and expenditure. This energy homeostasis is ulti- 
mately governed by the brain, where a variety of afferent 
signals reflect the animal's nutritional state and its ex- 
ternal environment particularly food availability. These 
signals are integrated in order to moderate efferent path- 
ways that control feeding behavior and energy expendi- 
ture (Bray, 1995; Rosenbaum et al., 1997). The afferent 
signals include somatic sensory signals like smell and 
taste, gastrointestinal signals like mechanical distention 
and chemical stimuli to the mucosa, blood-born metab- 
olites like glucose and free fatty acids, and "lipostatic" 
signals such as leptin (Friedman, 1997). Together with 
the regulation of metabolic rate, which is in part medi- 
ated by sympathetic and parasympathetic nervous sys- 
tems and thyroid hormone, the regulation of feeding 
behavior constitutes the efferent wing of energy homeo- 
stasis. The central site of integration is the hypothalamus 
(Oomura, 1980; Bernardis and Bellinger, 1993, 1996), 
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Figure 7. Up-Regulation of prepro-orexin mRNA in Fasted Rats 

(A) Northern blot analysis of hypothalamic prepro-orexin mRNA expression in freely fed and 48 hr fasted rats. Each lane contains 20 ng of 
total RNA from rat diencephalon. The membrane was rehybridized with NPY cDNA and then with p-acf/n cDNA as internal reference for the 
amount of RNA loaded. 

(B) Densitometric quantitation of Northern blot results presented in Figure 7A. The ratio of prepro-orexin and p-actfn mRNA signals was 
determined in each lane, and these normalized values were then compared between the fasted and fed groups. The mean value for normalized 
prepro-orexin mRNA expression in the fed group was arbitrarily designated as unity. 



where a large array of neurotransmitters, especially neu- 
ropeptides, modulate signals through complex neural 
circuits. 

Recent molecular genetic studies in rodents have 
started to provide convincing evidence supporting im- 
portant roles for certain hypothalamic neuropeptides in 
the central regulation of energy balance (Wolf, 1997). For 
example, NPY, which is negatively regulated by leptin 
(Zarjevski et al., 1993; Stephens et al. f 1995; Tartaglia 
et al., 1995), has been established as one of the positive 
regulators of feeding behavior, presumably acting on the 
Y5 NPY receptor (Gerald et al., 1 996); thus, the obesity of 
leptin-deficient mice is partially ameliorated when they 
are also deficient in NPY (Erickson etal., 1996b). Ectopic 
and constitutive expression of the endogenous antago- 
nist for melanocortin receptors, the Agouti protein, is 
the basis of obesity in obese-yellow mice (Yen et al. r 
1994; Fan et al., 1997), and, indeed, deficiency in the 
MC4 melanocortin receptor completely replicates the 
obesity syndrome of obese-yellow mice (Huszar et al., 
1997). Furthermore, Agouti-related protein (also termed 
ART), another endogenous antagonist for the MC4 re- 
ceptor, has recently been shown to be expressed in the 
hypothalamus, and the expression is up-regulated in 
ob/ob and db/db mice (Ollmann et al., 1997; Shutter et 
al., 1997). More recently, mice deficient for an orphan 
neuropeptide receptor (bombesin receptor type-3) have 
also been shown to become obese (Ohki-Hamazaki et 
al., 1997). Within the hypothalamus, two regions have 
traditionally been associated with the regulation of feed- 
ing and energy balance: the ventromedial hypothalamus 
and the lateral hypothalamic area. A classical series 
of lesion experiments led to the hypothesis that the 
ventromedial hypothalamus functions as a "satiety cen- 
ter," whereas the lateral hypothalamic area works as a 
"feeding center." Thus, a lateral hypothalamic lesion 



causes the animal to eat less and lose weight, some- 
times leading to death by starvation. While clearly over- 
simplified, these hypotheses still provide a valid concep- 
tual framework (Oomura, 1980). 

Recent studies continue to reveal the molecular basis 
for the role of periventricular/medial hypothalamic re- 
gions in energy homeostasis, e.g., ventromedial nu- 
cleus, arcuate nucleus, and paraventricular nucleus. 
Neurons containing neuropeptides such as NPY (Bing 
etal., 1996), melanocortins (JacobowitzandO'Donohue, 
1978), glucagon-like peptide-1 (Shughrue et al., 1996), 
and galanin (Warembourg and Jolivet, 1993), as well as 
the leptin and melanocortin-4 receptors (Mountjoy et 
al., 1994; Tartaglia et al., 1995) are abundant in one 
or more of these periventricular/medial hypothalamic 
regions. In contrast, few neuropeptides have been de- 
scribed to be produced chiefly in the lateral hypotha- 
lamic regions. Other than the orexins, we are aware of 
only one distinct neurotransmitter that is specifically 
produced in the lateral hypothalamus: melanin concen- 
trating hormone (MCH) has been localized in the zona 
incerta and the lateral hypothalamic area (Bittencourt 
et al., 1992). Intriguingly, MCH was recently reported 
to stimulate food intake upon central administration. 
Moreover, MCH mRNA is up-regulated in ob/ob mice 
and by fasting in wild-type mice (Qu et al., 1996). It will 
be important to investigate further the possible interplay 
of orexin with this and other positive (e.g., Agouti-related 
protein, NPY, galanin, and opioids) and negative (e.g., 
leptin, melanocortins, corticotropin-releasing factor, glu- 
cagon-like peptide-1, and cholecystokinin) regulators 
of energy balance both within and outside the central 
nervous system (Arase et al., 1988; Rosenbaum et al., 
1997). 

A decline of blood glucose levels can signal the initia- 
tion of food intake (Oomura, 1980). The lateral hypotha- 
lamic area contains glucose-sensitive neurons that are 
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activated by glucopenia and thus implicated in the posi- 
tive short-term regulation of feeding and energy expen- 
diture (Oomura et al., 1974). It is tempting to speculate 
that all or some of the orexin-containing neurons may be 
glucose-sensitive, or that they may receive stimulatory 
projections from glucose-sensitive neurons. Future ex- 
periments will also determine whether orexins have ad- 
ditional actions relevant to nutritional homeostasis, such 
as effects on the regulation of systemic energy expendi- 
ture, secretion of metabolic hormones such as insulin, 
and ultimately, the regulation of body weight. 

The present discovery of orexins and their receptors 
may provide a novel molecular basis for the role of the 
lateral hypothalamic areas in the regulation of feeding 
behavior. It is clear, however, that the definitive assign- 
ment of physiological roles for orexins requires further 
pharmacological as well as molecular genetic investiga- 
tions. Nevertheless, pharmacological intervention di- 
rected at the orexin receptors may prove to be an attrac- 
tive avenue toward the discovery of novel therapeutics 
for diseases involving disregulation of energy homeo^ 
stasis, such as obesity and diabetes mellitus. 

Experimental Procedures 
Transfected Celts 

Full-length cDNAs encoding individual orphan GPCRs were sub- 
cloned into the pCDN mammalian expression vector. These plas- 
mids were individually transfected into the human embryonic kidney 
(HEK) 293 cells, and stable transfectant cell lines established and 
maintained as described (Aiyar etal., 1994). For in vitro pharmaco- 
logical assessments (Figure 3), CHO-K1 cells were stably trans- 
fected with human OX t R and OX 2 R expression vectors, and 
transfected cell lines were maintained as previously described (Xu 
et al., 1994). 

Purification of Orexin 

Approximately 200 g of frozen rat whole brain (150 pieces, Pel- 
Freeze) were homogenized by Polytron in 10 x volume of 70% (v/v) 
acetone, 1 M acetic acid, and 20 mM HCI. The homogenate was 
centrifuged at 20,000 x g for 30 min at 4°C. The resultant supernatant 
was collected and extracted three times with diethyl ether. The 
aqueous phase was centrifuged at 20,000 x g for 30 min at 4°C, 
and the supernatant was loaded onto two 10 g cartridges of Sep- 
Pak C18 (Waters), which were preequilibrated with 0.1% (v/v) TFA. 
Cartridges were washed with 5% CH 3 CN/0.1% TFA, and then eluted 
with 50% CH 3 CN/0.1% TFA. The eluate was lyophilized, redissolved 
in 1 M acetic acid with sonication, and then filtered through a 20 
iLm Mirex GV filter (Millipore). Step 1 (Figure 1A): the extract was 
directly loaded onto a C, a reverse-phase HPLC column (Vydac 
218TP510; 10 mm x 250 mm), preequilibrated with 3%CH 3 CN/0.1% 
TFA at a flow rate of 3 ml/min at room temperature. A 10%-40% 
gradient of CH 3 CN in 0.1% TFA was then applied over 100 min. 
Three-milliliter fractions were collected, and =2% of each fraction 
was set aside and assayed for [Ca 2 *L transients in OX^-expressing 
HEK293 cells (see below). Step 2 (Figure 1B): the active fractions 
were pooled and directly applied to a cation-exchange HPLC col- 
umn (TosoHaas SP-5PW; 7.5 mm x 75 mm) preequilibrated with 20 
mM Na-phosphate (pH 3.0)/30% CH,CN at room temperature. A 
0-0.5 M gradient of NaCI in 20 mM Na-phosphate (pH 3.0)/30% 
CH 3 CN was applied over 60 min at a flow rate of 1 ml/min. One- 
mitliliter fractions were collected, and ~3% from each fraction was 
used for the [Ca 2+ J, assay. Step 3 (Figure 1 C): the active fractions 
were pooled, diluted 4-fold with 0.1% TFA, and directly loaded onto 
an analytical C 18 reverse-phase column (Vydac 218TP54; 4.6 mm x 
250 mm) preequilibrated with 3% CH 3 CN/0.1% TFA at a flow rate 
of 1 ml/min at room temperature. A 21%-36% gradient of CH 3 CN 
in 0,1% TFA was applied over 75 min. Individual UV absorption peaks 
(210 nm) were collected manually, and ~3% from each fraction was 



assayed. Step 4 (Figure 1 D): the active peak was diluted 4-fold with 
0.1% TFA and directly loaded onto the same d a column, but this 
time preequilibrated at 40°C with 3% CH 3 CN/0.1% TFA. A 21%-36% 
gradient of CH 3 CN in 0.1% TFA was applied over 75 min at 40°C. 
The major 210 nm peak, representing virtually pure orexin -A, was 
collected manually. For purification of rat orexin -B and orexin-B[3- 
28] as well as bovine orexin-A, basically the same procedures were 
used, except that step 3 was replaced with a reverse-phase HPLC 
under neutral pH: active fractions from step 2 were directly loaded 
onto the same analytical d 8 column preequilibrated with 3% CH 3 CN/ 
20 mM Tris-HCI (pH 7.0 at 40°C). A 3%-40% gradient of CH 3 CN in 
20 mM Tris-HCI (pH 7.0) was applied over 74 min at 40°C. Individual 
210 nm peaks were collected manually, and the active fraction was 
subjected to step 4 as described above. 

Peptide Microsequencing 

Direct sequencing of the intact orexin-A peptide by Edman degrada- 
tion failed, suggesting a blocked amino terminus. The monoisotopic 
molecular mass of the intact peptide was determined to be 3558,7 ± 
0.1 Da by time lag-focusing matrix-assisted laser desorption/ioniza- 
tion (MALDI) mass spectrometry (Carr and Annan, 1996) using inter- 
nal peptide standards (Micromass TofSpec-SE). The measured 
mass of the peptide increased by 232 Da upon reaction with iodo- 
acetamide in the presence of DTT, indicating the presence of two 
intramolecular disulfides bonds. Digestion of the reduced and alkyl- 
ated peptide with Lys-C provided two peptides with average M r = 
1287 and 2530. Edman sequence of the mixture provided a single 
partial sequence of TCSCRLYELLHGAGNHAAG. Edman sequence 
analysis of an Asn-C digest mixture extended the C -terminal se- 
quence as follows . . . HAAGILTL. The C-terminal sequence, . . . EXX 
HGAGNHAAGXXTX (where X is either Leu or He), was also confirmed 
by MALDI mass spectrometry of the peptide partially digested with 
carboxypeptidase Y (Voyager Elite XL, PerSeptive Biosystems). The 
blocked N-terminal Lys-C peptide was sequenced by nanoelectro- 
spray collision induced dissociation (CID) tandem mass spectrome- 
try (Carr et al., 1996; Wilm and Mann, 1996) on a triple quadrupole 
mass spectrometer (Perkin-Elmer Sciex API -III). A sequence of 
<EPXPDCCRQK (calculated MH + - 1286.6) was determined from 
the tandem MS data, where <E is pyroglutamic acid, and X is either 
Leu or lie. The identity and sequence of the first two residues were 
confirmed by MS 1 of the b2 ion produced by in-source CID. X was 
determined to be Leu by amino acid analysis of the HPLC purified 
N-terminal peptide. The Gin residue at position 9 was distinguished 
from Lys (both amino acids have the same residue mass) by acetyla- 
tion of the peptide and remeasurement of the molecular weight. The 
MH + ion for the ligand shifted by 42 Da from 1286.6 to 1328.6, 
indicating the addition of only one acetate group (calculated MH + = 
1328.6). Because Gin residues cannot be acetylated and the N termi- 
nus is blocked, the addition of only one acetate group strongly 
suggests the C-terminal sequence is QK, not KK. The M, determined 
for the intact peptide (3558.7 ±0.1 Da) indicates that the C terminus 
is an amide rather than a free acid and that all four Cys are in 
disulfide linkages (calculated monoisotopic M r = 3558.7). 

Cloning of Rat prepro-orexin cDNA 

A part of the orexin-A sequence, Q P LPDCC RQ KTC SCRLYELLHGA 
GNHAAGI (amino acids 1-30), was chosen to design highly degener- 
ate oligonucleotide primers encoding its ends: 5'-CA(A,G)CCN(C,T)T 
NCCNGA(C,T)TG{C,T)TG-3' and 5'-ATNCCNGCNGC(A,G)TG(A,G)TT- 
3' . A cDN A fragment correctly encoding the peptide was obtained by 
RT-PCR using rat brain poly(A) RNA as template. A nondegenerate 
oligonucleotide primer, 5'-GTTGCCAGCTCCGTGCAACAGTTCGTA 
GAGACGG-3', was designed based on the sequence of the RT-PCR 
product. Double-stranded cDNA was synthesized from rat brain 
poly(A) RNA, ligated to the Marathon adaptor (Clontech), and used 
as template for the initial 5 '-RACE reaction with Adaptor Primer 1 
(Clontech) and the above specific primer. A nested PCR reaction 
was performed using an oligonucleotide, 5'-CGGCAGGAACACGTC 
TTCTGGCG-3', and the Adaptor Primer 2 (Clontech). An approxi- 
mately 250 bp 5' cDNA product, correctly encoding the peptide, 
was obtained. Single-stranded rat brain cDNA was then synthesized 
using an oligonucleotide, 5'-CCTCTGAAGGTTCCAGAATCGATAG 
(T)2 S (A,C,G)N-3', as a specific primer for the reverse transcription. 
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This was used as template for a 3 -RACE reaction using a specific 
primer, 5'-TCCTTGGGTATTTGGACCACTGCACCGAAG-3' (corre- 
sponding to a part of the 5'-noncoding region of the cDNA sequence 
obtained by the 5'-RACE), and an anchor primer, 5'-CCTCTGAAGGT 
TC C AG AATC G AT AG - 3 ' . The product was subjected to a nested 
PCR reaction using an oligonucleotide, 5' -ATACCATCTCTCCGGAT 
TGCCTCTCCCTGA-3', and the same anchor primer. A discrete 
0.6 kb product containing the correct 5' cDNA sequence was ob- 
tained. The full-length cDNA sequence was determined by re se- 
quencing the products obtained from three independent 3' -RACE 
reactions. 



Cloning of Human and Mouse prepro-orexin Genes 
Since we found that the full-length rat prepro-orexin cDNA, which 
contains CTG triplet repeats (encoding the oligo-leucine stretch in 
the signal sequence), tends to cross-hybridize with a number of 
unrelated genes, we produced a cDNA probe that did not contain 
the repeats by RT-PCRfrom rat brain RNA, using primers 5'-CAGCC 
TCTGCCCGACTGCTGT-3' and 5'-CGTCTTTATTGCCTAGGGACTG 
GCGAGGAG-3'. Human and mouse genomic libraries (Clontech and 
Strata gene, respectively) were screened by plaque hybridization to 
this cDNA probe. Resulting genomic fragments containing the entire 
prepro-orexin coding sequences were further characterized and 
sequenced by standard procedures. 

Cloning of OX,/? and OX 2 R cDNAs 

Cloning of full-length human OX,R (HFGAN72) cDNA has been dis- 
closed elsewhere (US Patent No. W096/34877; November 7, 1996) 
(Figure 2C). Rat OX,R cDNA was obtained by screening rat brain 
cDNA libraries with the full-length human OX,R cDNA as probe, 
using standard procedures. In order to clone full -length human OXjR 
cDNA, an inter-EST RT-PCR was performed with human brain 
poly {A)' RNA as template, using oligonucleotide primers based on 
the public 5' and 3' EST sequences (GenBank accession numbers 
D81887 and W86548, respectively). The resultant cDNA product, 
which contained a large part of the coding region spanning the 
transmembrane domains 1-7, was further extended by standard 5'- 
RACE and 3'-RACE procedures. Rat OX 2 R cDNA was obtained by 
screening rat brain cDNA libraries with the full-length human OX 2 R 
cDNA as probe. 

Radiation Hybrid Mapping 

The following oligonucleotides were used to amplify genomic frag- 
ments: prepro-orexin, GCCAAAGGTGTCTCCACTC and GACAGGT 
GCAAACGGAGCAC; OXjR, GGTGCTCACCAGCGTCACCAC and 
GCTTAATCCTCACATCAACCCTGC; OX 2 R. GAGGAGCTTGCAGCAT 
TGAGC and GTGCAGGTATTCCCTCCACAG. PCR of the prepro- 
orexin, OXjR, and OX 2 R genes was performed in duplicates against 
the 96 samples (including 3 controls: donor DNA, host DNA, and 
water) in the GeneBridge 4 radiation hybrid panel (Research Genet- 
ics) (Gyapay etal., 1996). The scoring strings of 93 x 1's (presence), 
0's (absence), and 2's (ambiguous) were submitted to the Radiation 
Hybrid Mapping Server at MIT (http://www.genome.wi.mit.edu/) 
(Hudson et al., 1 995). Cytogenetic locations were inferred from these 
results using the markers shown to neighbor the genes via The 
Genome Directory (1995), and GDB (http://gdbwww.gdb.org/gdb/ 
docs/gd bhome.htm I). Chromosomal locations were confirmed by 
PCR on the BIOS panel (BIOS, USA) using the same oligo pairs and 
conditions. 



Radioligand Binding Assay 

Synthetic human orexin-A was 12S l-labeled at Tyr17 by Chlora- 
mine-T oxidation in the presence of Na 12S l (2,000 Ci/mmol, New 
England Nuclear), and monoiodinated peptide was purified by C" 
reverse-phase HPLC as described (Takigawa et al., 1995). Stable 
transfectant CHO cell lines expressing human OX,R or OX 2 R were 
each seeded onto 12 -well plates at a density of 3 x 10 5 ceils per 
well. After an overnight culture, medium was discarded and cells 
were incubated at 20°C for 90 min with binding buffer (HEPES- 
buffered saline/0.5% bovine serum albumin) containing 10~ 10 M 



[ 12s l]orexin-A plus designated concentrations of unlabeled competi- 
tors. Cells were then washed three times with ice-cold phosphate- 
buffered saline, lysed in 0,1 N NaOH, and cell-bound radioactivity 
was determined by a 7-counter, 

Intracellular Calcium Transient Assay 

Stable transfectant HEK293 cells expressing orphan GPCRs were 
loaded with Fura-2 AM in suspension, and [Ca 2+ 1 transients evoked 
by agonists were monitored by a Model CAF-110 Intracellular Ion 
Analyser (JASCO) in 500 p.! cuvette as previously described (Grynkie- 
wicz et al., 1 985; Xu et al., 1 994). For in vitro pharmacological charac- 
terization (Figure 3), the same procedures were performed using 
stably transfected CHO cells expressing human OXjR or OX 2 R. 

In Situ Hybridization 

A 0.29 kb segment of rat cDNA encoding Gln33-Ser128 of prepro- 
orexin was generated as described above and subcloned into 
pBluescript II SK(+) vector. Sense and anti-sense riboprobes were 
generated with T7 and T3 RNA polymerases, respectively, using the 
Maxiscript kit (Ambion) in the presence of 3S S-CTP (Amersham). 
In situ hybridization to adult rat brain sections was performed as 
described (Benjamin et al., 1997). 

Immunohistochemistry 

Anti-orexin-A antiserum was raised in rabbits by immunization with 
synthetic orexin-A[14-33], CRLYELLHGAGNHAAGILTL-amide, con- 
jugated to keyhole limpet hemocyanin (Calbiochem) using 3-malei- 
midobensoic acid N-hydroxysuccimide ester (Sigma). Antiserum 
was affinity purified on an orexin-A-conjugated Sepharose CL-4B 
(Pharmacia) column and used for immunohistochemistry. Male 
Sprague-Dawley rats (~300 g) were anesthetized and perfused via 
the left cardiac ventricle with phosphate-buffered saline (PBS). Brain 
was taken out directly embedded in OCT compound fJissue-Tek), 
and frozen in liquid nitrogen. Cryostat sections (15 ^m) were cut 
and mounted on silane-coated glass sides. The sections were post- 
fixed with 4% paraformaldehyde in 0.1 M phosphate buffer (pH 7.2) 
for 1 hr and washed three times in PBS. The sections were incubated 
with 1% bovine serum albumin in PBS for 1 hr and then incubated 
with affinity-purified antiserum in the same solution for 1 hr at room 
temperature. After washing three times in PBS, the sections were 
incubated with FITC-conjugated goat anti-rabbit IgG antibody (Cap- 
pel) for 1 hr at room temperature. Slides were then washed three 
times in PBS and examined under fluorescence microscope. 

Intracerebroventricular Administration of Orexins 
Male Wistar rats (180-200 g on arrival; Charles- River) were housed 
under controlled lighting (12 hr light-dark cycle) and temperature 
(22°C) conditions. Food (standard chow pellets) and water were 
available ad libitum. Rats (200-220 g) were anesthetized with pento- 
barbital (50 mg/kg i.p.), positioned in a Koph Model 900 stereotaxic 
frame, and implanted with a guide cannula into the left lateral ventri- 
cle under sterile conditions using a MEDIBIO Optical Brain Tracer 
(Muromachi Kikai) (Ikeda and Matsushita, 1980). The coordinates 
used to map the correct positioning of the implants were: 6.1 mm 
anterior to the lambda, 1 .5 mm lateral from the midline, and ~3.4 
mm (guided by MEDIBIO) ventral to the skull surface, with the incisor 
bar set 3.3 mm below the interauricular line. Rats were then housed 
singly under the same conditions as above for a recovery period of 
at least 7 days, and body weights were monitored daily for the 
duration of the study. After recovery from surgery, rats were trans- 
ferred to grid -floor cages and fed with powdered chow so that food 
intake measurements could be made. The rats were acclimated to 
the new environment at least for 1 day. The position of the cannula 
was verified by central administration of human NPY (3 nmol in 
sterile water); for a positive test, at least 8 g of food was eaten over 
a 4 hr period postinjection. Only positively testing animals (n = 8-10) 
were used. The studies were conducted according to a multidose, 
crossover design, with the order of dosing determined using the 
Latin square principle, leaving at least one rest day between admin- 
istrations. All doses were delivered in a volume of 5 |xl in sterile 
water over 30 s, and the injector remained in position for a further 
30 s to ensure complete dispersal of the peptide. All intracerebro- 
ventricular administrations began at 2 h into light cycle, and food 
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intake was measured at 1, 2, and 4 h intervals. All peptides were 
dissolved in sterile water, initially at 6 mM, and diluted in water as 
needed. Water alone was used for the vehicle control. 
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